spark icon indicating copy to clipboard operation
spark copied to clipboard

Apache Spark - A unified analytics engine for large-scale data processing

Results 649 spark issues
Sort by recently updated
recently updated
newest added

### What changes were proposed in this pull request? This pull request adds the following metrics to JSON file scan nodes to collect metrics related to variants being constructed as...

SQL

### What changes were proposed in this pull request? When there is any predicate missing in getPartitionsbyFilter and it tries to fetch all the partitions, the request is broken into...

SQL

### What changes were proposed in this pull request? In the PR, I propose to remove the constructors of `SparkException` that accept a plan string as an error message, and...

SQL
STRUCTURED STREAMING
BUILD
CORE
PYTHON
CONNECT

### What changes were proposed in this pull request? In the PR, I propose to rename the `errorClass` parameter to `condition` in `JdbcDialect.classifyException` and in `JdbcUtils.classifyException`. ### Why are the...

SQL

### What changes were proposed in this pull request? Call `executeCollect` when the DataFrame is `CollectLimitExec` or `CollectTailExec`. ### Why are the changes needed? For limit execution, `SparkPlan.executeCollect` is more...

SQL
CONNECT

### What changes were proposed in this pull request? This PR is intended to improve analyzer performance by skip applying the DeduplicateRelations rule when there is a guarantee that there...

SQL
STRUCTURED STREAMING
PYTHON
CONNECT

### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? ###...

SQL
STRUCTURED STREAMING

### What changes were proposed in this pull request? In `jsonObjectKeys`, we should call `skipChildren()` properly for nested object. ### Why are the changes needed? Currently if `jsonParser.currentName()` is null,...

SQL

### What changes were proposed in this pull request? This PR adds support for users to provide a Dataframe that can be used to instantiate state for the query in...

SQL
STRUCTURED STREAMING
DOCS
CORE
PYTHON

### What changes were proposed in this pull request? This PR changes the message of ARITHMETIC_OVERFLOW error and adds relevant try expressions: - try_make_interval - try_conv - try_round - try_bround...

SQL
DOCS
PYTHON
CONNECT