druid
druid copied to clipboard
Apache Druid: a high performance real-time analytics database.
The [Druid catalog](https://github.com/apache/druid/issues/12546) provides a collection of metadata "hints" about tables (datasources, input sources, views, etc.) within Druid. This PR provides the foundation: the DB and REST layer, but not...
If Kafka is configured with SASL / SCRAM, then Druid, which is the Kafka consumer, should pass the SASL / SCRAM credentials in the consumerProperties block of ioConfig section of...
Using the default settings provided by singler-server/medium, the coordinator-overlord log is filled with warnings such as: 2020-03-03T14:00:07,794 WARN [Coordinator-Exec--0] org.apache.druid.server.coordinator.rules.LoadRule - No available [_default_tier] servers or node capacity to assign...
Fixes multiple items in #12881 ### Description This PR lays the ground work to allow load queue to safely have an unlimited number of items, and thus eventually phase out...
Please provide a detailed title (e.g. "Broker crashes when using TopN query with Bound filter" instead of just "Broker crashes"). ### Affected Version Tested on latest master branch 25.0.0-SNAPSHOT ###...
- update Apache Ranger to the latest version 2.3.0 - override gethostname4j dependency of Apache Ranger to use 1.0.0 and JNA > 5.7.x to fix failing unit tests on macOS...
### Description In MSQ, there can be an upper limit to the number of worker warnings. For example, for parseExceptions encountered while parsing the external data, the user can specify...
### Description In the current version of MSQ, workers are bound to a specific task. This is not cool for a fault tolerant engine where worker tasks can be ephemeral,...
### Current Implementation In the current MSQ implementation, each worker maintains a ClusterByStatisticsCollector. If we are clustering by time, one sketch is maintained for each time chunk. During the merge,...