datafusion-ballista
datafusion-ballista copied to clipboard
Apache Arrow Ballista Distributed Query Engine
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** In current code, each poll work request will scan all waiting task...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** We need to check whether the states saved in the sled is...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** Let's say we've an organization that deploys ballista cluster to which multiple...
**Describe the bug** Start ballista ui with docker, but it can not found ballista scheduler **To Reproduce** Steps to reproduce the behavior: I clone the core from master branch. And...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** We are seeing PRs merged that cause regressions in Ballista because we...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** PR https://github.com/apache/arrow-datafusion/pull/320 implemented hash partitioned aggregation in DataFusion. We should implement the...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** I have a Ballista cluster running, and each scheduler and executor has...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** As mentioned in https://github.com/apache/arrow-datafusion/pull/543#discussion_r650883972 we could optimize for the case where there...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** By default, DataFusion uses hash-partitioned joins if concurrency > 1 which led...
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** There is currently no way to specify configuration parameters such as default...