libpysal
libpysal copied to clipboard
refactor examples
we've had some recent issues where filenames from some remote datasets were changed, unknowingly breaking access to some data. We want to restructure the examples a bit as following:
- borrow design ideas (and possibly hook into) pooch and intake
- define datasets as a collection of YAML files (so users can also contribute datasets by simply defining the appropriate fields in a yaml
- parse the geoda site using current machinery and build those yamls on the fly
- include CI tests that examine whether remote datasets can be accessed (and optionally check for cols/rows)
- Fix pandas rendering of available datasets
a few of the links on geoda's datasets page aren't always as advertised (e.g. "cars.zip" is really "Abandoned_Vehicles_Map.csv" and "CHIM.zip" is really a box.com link), though most of them are accurate
I have a start at the cataloging of the remotes with yml files to allow for users to easily add new remote examples. Working on the branch catalog.