the_od_bods icon indicating copy to clipboard operation
the_od_bods copied to clipboard

Add Stagecoach Open Data as a source

Open JackGilmore opened this issue 3 years ago • 3 comments

On the back of some Twitter enquiries about bus open data I discovered Stagecoach publish their schedules and fares as open data: https://www.stagecoachbus.com/open-data

As these are just file downloads as a page we'll need to write a scraper for this.

Considerations

  • What is the license for these? It doesn't appear to be explicitly stated. Maybe worth getting in touch with Stagecoach on the email address on the page to ask them
  • The file downloads themselves are just zip files that are split up by region that contain XML files
    • Should we consider unzipping these files and serving the individual XML files?
    • The file downloads cover regions outside Scotland (e.g. England and Wales). Should we include these?

JackGilmore avatar Aug 04 '22 13:08 JackGilmore

The page states that the data is

available to the public, for personal, educational or commercial use

so we should be fine to publish this ourselves.

Furthermore, the data is auto-updating, ie. when fares get updated, so we should really make wee scrape script to update this data every so often.

johnnymck avatar Sep 10 '22 12:09 johnnymck

Sweet! I think the best way to split these up would be to have a dataset per region and then have the individual files with the schedules and fares attached to the dataset e.g.

JackGilmore avatar Sep 10 '22 13:09 JackGilmore

@johnnymck did you manage to get anything started on this that you could commit to a branch/fork?

JackGilmore avatar Sep 12 '22 09:09 JackGilmore