
Our 5 favourite open-source customer data platforms

DeSo is custom-built from the ground up to power social applications, and that means that all of the data that it stores and indexes follows a known schema. Profiles are stored and indexed differently than posts, which are stored and indexed differently than follows, etc... This level of customization not only makes storage costs 10,000x cheaper th... See more
Deso • Web3 Will Not Be Built on Smart Contracts • DeSo (Decentralized Social) Blockchain
You can start to look at patterns over years, over seasons, across demographics.” Hadoop, however, is not easy to use and you need a great deal of training and expertise that many companies don’t yet have. Alternatively, you can use an implementation such as Amazon EMR, which removes much of the complexity, as Etsy has done.
Ray Velez • Converge: Transforming Business at the Intersection of Marketing and Technology
- ClickHouse: It's a high performance columnar database that's great for real time queries. It enables querying and storing large amounts of data on commodity hardware. Some of my customers have millions of page views and I don't have an unlimited budget, so it's been very handy.
- PostgreSQL: My favorite database. Sane defaults, battle-tested, and well
The Tech Stack of a One-Man SaaS
Most commonly, ETL means moving data from some source system (e.g. a production database, Slack API) into an analytical data warehouse (e.g. Snowflake) where the data is easier to combine and analyze. Most data teams use a vendor like Fivetran or an orchestration platform like Airflow to do this.
Modal is a great solution for ETL if you are primaril... See more
Modal is a great solution for ETL if you are primaril... See more