etl
etl copied to clipboard
M-Lab ingestion pipeline
A 2019-12-20T16:02:30Z 2019/12/20 16:02:30 insert.go:273: tcpinfo googleapi: Error 403: retry failed with context deadline exceeded; last error: Exceeded rate limits: Your table: mlab-sandbox:batch.tcpinfo_20191218 exceeded quota for streaming insert bytes per...
Part of m-lab/dev-tracker#430 KR: Simplify Gardener, ETL Tuning, and New Parser Development Also see #732 As parser handles each tar file, it should report back to Gardener the partition dates...
Part of m-lab/dev-tracker#501
Part of m-lab/dev-tracker#501
Part of m-lab/dev-tracker#501
The universal parser uses a private network, so it can make requests to gardener. This breaks the prometheus scraping. Probably interferes with GCE discover, and may also make the instances...
Including, but not limited to: server hostname, machine, site, and metro names. Migrated from https://github.com/m-lab/ndt-server/issues/206 EDIT(soltesz): Additionally, our core data sets (ndt, tcpinfo, pt, utility) should use _consistent_ top level...
NDT5 results are not annotated with client geolocation data. This makes impossible lots of analyses we want to do.