vega_datasets icon indicating copy to clipboard operation
vega_datasets copied to clipboard

World airports dataset

Open edschofield opened this issue 7 years ago • 1 comments

Nice work on vega_datasets and altair! 😃

It would be great for the entire world airports dataset to be included in vega_datasets, not just a subset for those in the USA. It would make for many more interesting visualization possibilities, starting with this:

alt.Chart(world_airports[:5000]).mark_bar().encode(
    x='count()',
    y='Country:N'
)

and then filtering by country, timezone etc.

This dataset is around 8000 rows, which would also serve the useful purpose of demonstrating how to handle datasets longer than Altair's default limit of 5000 rows. This limit is likely the first hurdle most people using Altair for real datasets will have to surmount. (I'd also gladly volunteer to help to make handling of large datasets more seamless in Altair...)

Example URLs:

  • https://github.com/datasets/airport-codes/blob/master/data/airport-codes.csv
  • https://raw.githubusercontent.com/ajdapretnar/datasets/master/data/global_airports.csv

edschofield avatar May 21 '18 02:05 edschofield

Great suggestion! All datasets accessed by this package come from the vega/vega-datsets repository. Any new datasets should be proposed there.

jakevdp avatar May 21 '18 03:05 jakevdp