ChangesetMD
ChangesetMD copied to clipboard
Simple XML parser to shove OpenStreetMap changeset metadata dump files into a postgres database
Needs some testing still.
Solves #32
PR for #34
I think it would be a benefit to pin the dependencies, ie. using [poetry](https://python-poetry.org/). PR Incoming.
Upgrade to Python e.g. 3.7.
Using `pyscopg2.extras.execute_batch()` and inserting data in batches of 100,000 changesets will speed up the import. On my machine it was ~2x faster than the original method of inserting data line...
A changeset that only touches a node without moving it can have min_lat=max_lat and min_lon=max_lon. These are currently encoded as polygons, but zero-area polygons are very hard to make normally....
I'd like to add a user_name table for #19, and the easy way to do this would be an `INSERT ... ON CONFLICT ...` statement, but that requires 9.5. Thoughts...
Some suggested changes, based on writing a lot of queries - Normalize user IDs to have id-> name mapping in their own table. I do a lot of queries on...