datadex
datadex copied to clipboard
Make it easy to play with the data
Datadex is producing several datasets. Right now, they're all in HuggingFace as static files.
Should be easy for anyone to click a button from the Portal and have an environment with all the datasets ready to be used. Similar to https://github.com/datonic/datadex/issues/32#issuecomment-1801693041.
The ideal solution is:
- Fully static (e.g: DuckDB WASM + Pyodide + Parquet files)
- Can be URL encoded so folks can share what they work on
Reallistically, we can start with a Colab Notebook that grabs all the datasets from HuggingFace.