maha
maha copied to clipboard
Return all rows from Druid GroupBy query
Is there a way to execute a Druid GroupBy query where we get back all rows? Looks like there's currently a defaultMaximumMaxRows
value. https://github.com/yahoo/maha/blob/d6b48f24353d3a915c169286fbfc4fd80a1bd82b/core/src/main/scala/com/yahoo/maha/core/query/druid/DruidQueryGenerator.scala#L158
Last I checked Druid had a limit on the number of rows that can be sent from historical to broker and it didn't support pagination.
You can pass in your own value, see register method: https://github.com/yahoo/maha/blob/d6b48f24353d3a915c169286fbfc4fd80a1bd82b/core/src/main/scala/com/yahoo/maha/core/query/druid/DruidQueryGenerator.scala#L166
in the case of returning large rows, you might want to use OffHeapRowList https://github.com/yahoo/maha/issues/675