estate
estate copied to clipboard
With single command build a beautiful web scraping tool for scheduled scraping and store scraped data in postgres database
Dagster has changed a lot. Update knowledge and redo the pipeline
Update the project to keep up with the new changes in Dagster, and Docker
See https://docs.dagster.io/concepts/resources
Use [MindDB](https://github.com/mindsdb/mindsdb) to train and score models
Introduce Redis to store models Jupyter Lab for modelling Kibana for alerting
Decouple data scraping and sending data to Postgress. Dump data to S3|Minio and have another pipe to pick data from dump to Postgres
Use Pytest to write dags tests. # Code without tests is an illusion
Extra tools needs there own yaml - Redis UI - Grafana - ELK
Use dagfactory [see examples](https://github.com/ajbosco)