maha icon indicating copy to clipboard operation
maha copied to clipboard

Return all rows from Druid GroupBy query

Open ishan8 opened this issue 4 years ago • 3 comments

Is there a way to execute a Druid GroupBy query where we get back all rows? Looks like there's currently a defaultMaximumMaxRows value. https://github.com/yahoo/maha/blob/d6b48f24353d3a915c169286fbfc4fd80a1bd82b/core/src/main/scala/com/yahoo/maha/core/query/druid/DruidQueryGenerator.scala#L158

ishan8 avatar May 18 '20 17:05 ishan8

Last I checked Druid had a limit on the number of rows that can be sent from historical to broker and it didn't support pagination.

patelh avatar May 20 '20 06:05 patelh

You can pass in your own value, see register method: https://github.com/yahoo/maha/blob/d6b48f24353d3a915c169286fbfc4fd80a1bd82b/core/src/main/scala/com/yahoo/maha/core/query/druid/DruidQueryGenerator.scala#L166

patelh avatar May 20 '20 06:05 patelh

in the case of returning large rows, you might want to use OffHeapRowList https://github.com/yahoo/maha/issues/675

pranavbhole avatar Jun 27 '20 00:06 pranavbhole