pygtfs icon indicating copy to clipboard operation
pygtfs copied to clipboard

Load fails on feed dates missing

Open vingerha opened this issue 1 year ago • 4 comments

Source: https://www.data.gouv.fr/fr/datasets/r/47bc8088-6c72-43ad-a959-a5bbdd1aa14f

Failure while writing FeedInfo(feed_publisher_name='Palmbus', feed_publisher_url='https://www.palmbus.fr', feed_lang='fr', feed_start_date=None, feed_end_date=None, feed_version=None)

feed_info.txt contains indeed no dates.

feed_publisher_name,feed_publisher_url,feed_lang,feed_start_date,feed_end_date,feed_version
Palmbus,https://www.palmbus.fr,fr,,,

vingerha avatar Nov 08 '23 13:11 vingerha

Oddly enough, without feed_info.txt...it does continue

vingerha avatar Nov 24 '23 13:11 vingerha

It would be 'nice' to be able to exclude things. For my solution I do not need shapes.txt and by removing this from the zip, the extract time and sqlite size reduce substially. Only....the auto-removal from zip is not that easy, it requires to unpack/repack the lot so although I have a solution, would still be nice to be able to exclude things :)

vingerha avatar Mar 29 '24 08:03 vingerha

That is an interesting feature request. So something like a flag? --ignore_tables=shapes ?

jarondl avatar Apr 13 '24 16:04 jarondl

Something would already be nice... at the moment I repack the gts by excluding feed_info and shapes as they either are erroneous or not providing value (for this ! integration). Repack means: open zip > read file > if OK > add to new zip > repeat till end ... then pygtfs. On a side note, For the netherlands gtfs, this takes hours, would it be an idea to apply conditions afterwards to speed it up?

vingerha avatar Apr 13 '24 17:04 vingerha