Dominik Moritz
Dominik Moritz
Or we check in a test whether the arrow/parquet has the right values and types without going back to csv in the test.
Yeah, for now don't use stdin. Would love a pull request to fix this.
Pretty much, yes. The seekable reader we use across the libraries is in a shared crate in https://github.com/domoritz/arrow-tools/tree/main/crates/arrow-tools. We can move any shared functionality into that crate.
This seems to work now. ``` $ cat data/simple.csv | csv2arrow /dev/stdin -n Schema: { "fields": [ { "name": "a", "data_type": "Int64", "nullable": true, "dict_id": 0, "dict_is_ordered": false, "metadata": {}...
Hmm, that sounds like a bug. We should not try to infer the schema when it's provided.
I don't know. Could you do a git bisect to find out?
Awesome. I think the main things are in the GitHub issues. Better support for stdin and stdout and tests are the main things. Installing via homebrew would require some CI...
Can you try again to see whether these issues are still there?
Thanks, that seems to fix it. ``` arrow-tools/crates/json2arrow on main [!⇡1] via 🦀 v1.81.0 ❯ echo '{"a": 1, "b": 2}' | jq -c 'to_entries|.[]' | cargo run -- -n...
I'll close this for now since we have https://github.com/domoritz/arrow-tools/issues/14 which I think covers what's left here.