Raphael Dumas
Raphael Dumas
Thanks for the answer @yuvipanda, this is not my area of expertise, but would be willing to help contribute if pointed in the right direction.
So I think the issue is because the API wasn't working at all Oct 5th, and so there was no `requests` data within a chunk of 2000 files (aka minutes?)....
@tloureiro thanks for catching this! I think I forgot to push code fixing this. I'll try to do so tonight...
Whoops I didn't push a fix and this is still an issue, more recently with 2021-06 data ``` Traceback (most recent call last): File "fetch_s3.py", line 210, in fetch_s3() File...
Hi Bohan, you don't need to have two versions of a notebook in Github, the version control takes care of versioning :)
Thanks friends!
Or we could use the form developed by my friend of use date + interval for the timestamps.
First draft ```sql SELECT DISTINCT ON (pollid, lineid, trainid, traindirection, stationid) lineid, create_date, traindirection, trainid, stationid, timint, train_message FROM ntas_data INNER JOIN requests USING (requestid) INNER JOIN polls USING (pollid)...
The arrival time inference algorithm I built in SQL has some issues. The below graph shows the difference in the number of trips having different lengths (number of stops) for...
@moh-osman is also working on this for headway analysis https://github.com/moh-osman/ttc_subway_times/blob/master/doc/ttc_headway_analysis.ipynb