libpysal icon indicating copy to clipboard operation
libpysal copied to clipboard

refactor examples

Open knaaptime opened this issue 5 years ago • 2 comments

we've had some recent issues where filenames from some remote datasets were changed, unknowingly breaking access to some data. We want to restructure the examples a bit as following:

  1. borrow design ideas (and possibly hook into) pooch and intake
  2. define datasets as a collection of YAML files (so users can also contribute datasets by simply defining the appropriate fields in a yaml
  3. parse the geoda site using current machinery and build those yamls on the fly
  4. include CI tests that examine whether remote datasets can be accessed (and optionally check for cols/rows)
  5. Fix pandas rendering of available datasets

knaaptime avatar Dec 17 '20 17:12 knaaptime

a few of the links on geoda's datasets page aren't always as advertised (e.g. "cars.zip" is really "Abandoned_Vehicles_Map.csv" and "CHIM.zip" is really a box.com link), though most of them are accurate

knaaptime avatar Dec 17 '20 19:12 knaaptime

I have a start at the cataloging of the remotes with yml files to allow for users to easily add new remote examples. Working on the branch catalog.

sjsrey avatar Dec 21 '20 00:12 sjsrey