Epic: Improved Dagster data replication
What is it?
We have a bunch of amazing Dagster data producing factories. We are going to need WAY more data than we have today.
This epic reflects a rewrite that improves both UX of adding new data sources, local testing, and the infrastructure for how it runs/orchestrated
- Phase 1: We rewrite our factories to support different backends (e.g. duckdb)
- Phase 2: We support local testing and dynamic seed generation
- Phase 3: Any user on the OSO web app can configure a new data source and monitor its progress
@IcaroG can you help fill in sub-issues for this?
@IcaroG it's also worth considering whether we want to also add support to just dynamically add Trino connectors (without replicating via dlt).
Going to deprioritize this for now as discussed, at least until we decide how to move forward on blockchain indexing
@Jabolol can you triage this? In particular, we want to revisit Dagster straight to Iceberg