Gian Merlino

Results 161 comments of Gian Merlino

It'd be better to make this work than to make this be an error. The way to do it is to avoid `Granularity.getIterable`, which is problematic when using non-ALL granularities...

Hmm. IMO, we should definitely change something, since the behavior of applying `FilenameUtils.wildcardMatch` to a path is just really weird. For example, this returns `true`: ``` FilenameUtils.wildcardMatch("a/b/c.txt", "a*.txt") ``` Which...

That's true, it's an ease-of-use vs. power tradeoff. I am legitimately worried that people will find it confusing to reason about whether, for example, prefix `s3://a/b` plus filter `b/*txt` applies...

> @gianm I would say it is quite common when people are massaging data using Spark into Iceberg. @didip would you mind giving an example of how people would use...

I'm considering a few things about the design: - We want the power of doing a whole-path glob. @didip describes scenarios where this additional power makes it more useful than...

I tried this locally and what happens here is: 1. The unsatisfiable filter leads to the table being swapped out with an inline empty recordset. This is good, it means...

I sketched out an idea for this issue. Here it is: https://github.com/apache/druid/compare/master...gianm:druid:scan-inline-sort. I haven't made a PR yet, since I'd need to add some tests. The idea in the patch...

Good question. That's another reason that little sketch isn't ready to be a PR 🙂 I think the answer is, if there's a limit then it's done by stableLimitingSort. If...

Thinking about it a bit more. I think it makes the most sense to do the sorting in the runner from createRunner. I actually think that is already happening: check...

Relevant Slack thread (link good for 90 days): https://apachedruidworkspace.slack.com/archives/C0303FDCZEZ/p1664492820183909 Original question: > Looking forward to trying out nested columns Is this feature currently supported for streaming ingestion? It appears not...