Simon Späti
1 min readMar 15, 2020

--

very nice write, thanks so much Joshua Feierman! I went exactly the same path and also wrote some articles about. One here related to #opensource data warehousing here https://www.sspaeti.com/blog/open-source-data-warehousing-druid-airflow-superset/

Two python libraries you need to add to your python code is https://github.com/dagster-io/dagster/ for pipelining and https://github.com/great-expectations/great_expectations for testing your pipelines (it’s integrated into dagster). Check it out, you won’t regret. I believe you don’t want to write your own orchestrator. Or how did you orchestrate so far?

--

--

Simon Späti
Simon Späti

Written by Simon Späti

Data Engineer & Technical Author with 15+ years of experience. I enjoy maintaining awareness of new innovative and emerging open-source technologies.

No responses yet