Gian Merlino

Results 161 comments of Gian Merlino

The doc in question is this page: https://druid.apache.org/docs/latest/querying/nested-columns.html

> This is not correct, at least for the HLL in datasketches-java (I'm not sure what the Druid adaptor does). Strings are encoded using UTF-8 and have been for as...

Pushed this stuff: - fixes for the issues @abhishekagarwal87 found - update to use the new API from https://github.com/apache/datasketches-java/pull/353 - improve test coverage for updateSketch and updateSketchWithDictionarySelector

I believe @AmatyaAvadhanula and @kfaraz were looking into this area of the code too, working to improve the Coordinator balancing behavior. Perhaps they will have some thoughts.

ForkingTaskRunner starts a new OS process and directs that process stdout and stderr to the `logFile`. We are relying on the fact that all output for an OS process goes...

> @gianm I was testing the MM-less patch on the msq work you did. I ran a test ingestion and the tasks just hang forever, after a bit of debugging...

@AmatyaAvadhanula there is a comment in TaskIdentifierMapper: ``` // If field is absent (older task version), use blank string to avoid a loop of migration of such tasks. ``` I...

I see, in that case my suspected explanation may not be what's actually happening.

Interesting. I am debugging another similar-sounding issue with TIME_FLOOR causing hung queries when run on inline data. It's related to materialization of excessive numbers of time buckets. I wonder if...