drill
drill copied to clipboard
Apache Drill is a distributed MPP query layer for self describing data
Unifying test class hierarchy and adding randomization functionality.
@parthchandra please review (just the third commit).
Change 'project.artifactId' from 'drill-root' to 'apache-drill' _Note: it includes changes for #1743 also (it is expected that PR will be merged first)_
This PR contains changes for the support of RM Framework both on execution and planning side, tracked by JIRA's DRILL-7191 and DRILL-7026. 1) Refactoring existing ZK based queue to accommodate...
The new option is passed to the TimedCallable code via the existing ParquetReaderConfig parameter, by adding a Long field there - timeoutPerRunnableInMsec . The timeout (TIMEOUT_PER_RUNNABLE_IN_MSECS) is used there for...
Currently, updating the version information for a Drill release involves updating 30+ pom files. The right way would be to use the Multi Module Setup for Maven CI. https://maven.apache.org/maven-ci-friendly.html#Multi_Module_Setup
This PR contains changes for implementing batch sizing in SingleSender. The sizing is done by using the RecordBatchSizer.
DRILL-6845: Semi-Hash-Join to skip incoming build duplicates, automatically stop skipping if too few
The first two commits here were extracted from the original PR #1522 (DRILL-6735), where the Semi-Hash-Join was implemented in a straightforward way: Read data like a regular hash join (e.g....
# Goal The goal of this change is to move code for managing a partition in HashAgg into a seperate class, similar to the [HashPartition](https://github.com/apache/drill/blob/master/exec/java-exec/src/main/java/org/apache/drill/exec/physical/impl/common/HashPartition.java) class in HashJoinBatch. This has...
Fixed several Issues due to Schema changes: 1) Changes in complex data types Drill Query Failing when selecting all columns from a Complex Nested Data File (Parquet) Set). There are...