Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.

arxiv.org

Jacopo Tagliabue Reproducible data science over data lakes: replayable data pipelines with Bauplan and Nessie.