Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

arxiv.org

Reflections on Palantir

nabeelqu.conabeelqu.co

Data composability: what it is + why it matters

Danny Zuckermandazuck.substack.com
Thumbnail of Data composability: what it is + why it matters

Data Engineering Data Orchestration Trends: The Shift From Data Pipelines to Data Products

ByteByteGo-Big-Archive-System-Design-2023

Link