spark icon indicating copy to clipboard operation
spark copied to clipboard

Apache Spark - A unified analytics engine for large-scale data processing

Results 649 spark issues
Sort by recently updated
recently updated
newest added

### What changes were proposed in this pull request? This PR changes the declaration type of `TaskMetrics#externalAccums` from `s.c.mutable.ArrayBuffer` to `j.u.concurrent.CopyOnWriteArrayList` to ensure that errors described in SPARK-39696(`java.util.ConcurrentModificationException: mutation occurred...

SQL
WEB UI
CORE

…tion scope ### What changes were proposed in this pull request? Updated some `spark.io.compression` configuration descriptions to clarify parameter application scope. ### Why are the changes needed? Users are easily...

DOCS

### What changes were proposed in this pull request? Add config to make DEFAULT values in JSON tables mutually exclusive with SQLConf.JSON_GENERATOR_IGNORE_NULL_FIELDS. When this new config is true, allow DEFAULT...

SQL

### What changes were proposed in this pull request? Followup for https://github.com/apache/spark/pull/37294, to improve test coverage by adding more tests. ### Why are the changes needed? To improve the test...

CORE
PYTHON
PANDAS API ON SPARK

### What changes were proposed in this pull request? Add config to toggle whether to automatically add default values for INSERTs without user-specified fields. Example: ``` CREATE TABLE t (a...

SQL

### What changes were proposed in this pull request? Enable implicit DEFAULT column values in inserts from DataFrames. This mostly already worked since the DataFrame inserts already converted to LogicalPlans....

SQL

currently, match parquet schema by id fails under certain case no new unit test added ### What changes were proposed in this pull request? in this PR, fixed cases where...

SQL

### What changes were proposed in this pull request? This adds UNPIVOT clause to SQL syntax. It follows the same syntax as [BigQuery](https://cloud.google.com/bigquery/docs/reference/standard-sql/query-syntax#unpivot_operator), [T-SQL](https://docs.microsoft.com/en-us/sql/t-sql/queries/from-using-pivot-and-unpivot?view=sql-server-ver15#unpivot-example), [Oracle](https://www.oracletutorial.com/oracle-basics/oracle-unpivot/): ``` FROM ... [ unpivot_clause...

SQL
MLLIB
KUBERNETES
GRAPHX
MESOS
BUILD
SPARK SHELL
YARN
EXAMPLES
DOCS
CORE
INFRA
PYTHON
DSTREAM

### What changes were proposed in this pull request? AQE uses statistics from completed query stages and feeds them back into the logical optimizer. AQE currently only uses `dataSize` and...

SQL

### What changes were proposed in this pull request? The current implementation of the `replaceWithAliases` method in `NestedColumnAliasing` replaces the children of the current plan with mapped aliases. When the...

SQL