Ian Dees
Ian Dees
I suppose it's not a requirement, but if we're publishing parquet it's because we expect it to be queried in some way. We would want to partition in order to...
> Is there a single zipped blob of GeoJSON available somewhere else? Nope that's the output. When I said "single zipped blob of GeoJSON" I meant the blob containing individual...
As an update here: I wrote a bit of Python code that converts all the GeoJSON into individual Parquet files and then concatenates them all into one, but I can't...
Here's a sample of OpenAddresses GeoJSON data for Hennepin county, MN converted to GeoParquet. [us_mn_hennepin.parquet.zip](https://github.com/user-attachments/files/15861219/us_mn_hennepin.parquet.zip) Converting the GeoJSON is pretty straightforward. Adding useful columns like e.g. source name and update...
Started a PR in https://github.com/openaddresses/batch/pull/394. Node's Parquet support is significantly less robust than Python's, so we'll have to see how this works.
Thanks for the suggestion! In situations like this, we usually build a script to do the work, document it thoroughly so we can run it in the future, and then...
Nice, looks like Canada!
The data is available at ```https://data.openaddresses.io/cache/uploads/iandees/760039/canada-nar-combined.csv.gz```
I think we had problems downloading from the Esri endpoint for the state for some reason (a slow database server behind it, if I remember right) and had to switch...
Thanks for collecting these, @justinelliotmeyers. I agree that these should get their own tickets to track progress more easily. I can try to do that this weekend but feel free...