Samuel Colvin
Samuel Colvin
@messense do you think you will be able to work on this? Otherwise, there's a chance the pydantic team might be able to. @davidhewitt do you have and thoughts on...
@alamb if you're interested in JSON parsing support I might be interested in contributing. We (Pydantic) maintain a very fast Rust JSON parser (generally significantly faster thane serde-json), [jiter](https://github.com/pydantic/jiter) which...
tiny update to my example above, I realised there’s a much better comparison query: ```sql -- datafusion SELECT count(*) FROM records where json_contains(attributes, 'size'); -- 6165747 in 18.0s -- datafusion...
This is great news @alamb. Just to confirm, are you happy to use `jiter` and the approach I described above in [datafuison-functions-json](https://github.com/datafusion-contrib/datafuison-functions-json)? If so, I'm happy to help both implementing...
https://github.com/datafusion-contrib/datafusion-functions-json now provides the following methods, I think we're nearly ready for a first release, see https://github.com/datafusion-contrib/datafusion-functions-json/issues/5. * [x] `json_contains(json: str, *keys: str | int) -> bool` - true if...
@jayzhan211 that doesn't work since the argument types don't tell you what type will be returned. e.g.: * if the value in column `foo` is `{"x": "abc"}`, then `json_get(foo, 'x')`...
ye, `return_type_from_exprs` doesn't help. I got around it mostly, with I think good performance by rewriting the query when there's a cast, so: ```sql select * from foo where json_get(attributes,...
sorry for the slow reply, yes exactly.
I'm working on this, initially in arrow-rs.
As I mentioned here https://github.com/apache/datafusion/issues/7845#issuecomment-2073675288, the most useful thing for me would be less ugly errors when you do try to compare a union with something else. Also it would...