spark issues

[SPARK-47193][SQL] Ensure SQL conf is propagated to executors when actions are called on RDD returned by `Dataset#rdd`

### What changes were proposed in this pull request? This change wraps the iterator returned by `SQLExecutionRDD#compute` so that it propagates the SQL conf at the time the iterator is...

bersprockets

SQL

[SPARK-49938][K8S] Prefer to use Hadoop configMap when specified

1

### What changes were proposed in this pull request? Currently, 'HADOOP_CONF_DIR' in ENV and Hadoop configmap cannot be both configured. However, in cloud vendor EMR environments, 'HADOOP_CONF_DIR' is often already...

zgzzbws

KUBERNETES

[WIP][SPARK-49816][SQL][FOLLOW-UP] Fix conflicting CTE ids

1

### What changes were proposed in this pull request? This is a follow-up PR that reverts https://github.com/apache/spark/pull/48284 in the first commit and offers a new way to deal with the...

peter-toth

SQL

[MINOR] Fix code style in RuleIdCollection.scala

### What changes were proposed in this pull request? Find a code style issue and fix it. ### Why are the changes needed? Fix code style ### Does this PR...

exmy

SQL

[DO-NOT-REVIEW][DRAFT] integration3

### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? ###...

WweiL

SQL

STRUCTURED STREAMING

BUILD

[SPARK-48922][SQL] Optimize nested data type insertion performance

3

### What changes were proposed in this pull request? To improve insertion performance, we do not need to add transform expressions when there is no conversion for complex types. ###...

wForget

SQL

[SPARK-49547][SQL][PYTHON] Support returning iterator of RecordBatches in applyInArrow

14

### What changes were proposed in this pull request? Add the option to `applyInArrow` to take a function that takes an iterator of `RecordBatch` and returns an iterator of `RecordBatch`....

Kimahriman

SQL

CORE

PYTHON

[SPARK-49711][SQL] Remove ExperimentalMethods

1

### What changes were proposed in this pull request? This PR removes ExperimentalMethod from SQL. This is the first extension point we had for Spark SQL. However it is has...

hvanhovell

SQL

STRUCTURED STREAMING

BUILD

[SPARK-49921][CORE] Add task write data time to SQL tab's graph node

### What changes were proposed in this pull request? Add task write data time to SQL tab's graph node. After adding the metric, the following figure is shown. ### Why...

huangxiaopingRD

SQL

[SPARK-49919][SQL] Add special limits support for return content as JSON dataset

1

### What changes were proposed in this pull request? `CollectLimitExec` is used when a logical `Limit` and/or `Offset` operation is the final operator. Comparing to `GlobalLimitExec`, it can avoid shuffle...

LantaoJin

SQL

spark
spark copied to clipboard

Metadata

[SPARK-47193][SQL] Ensure SQL conf is propagated to executors when actions are called on RDD returned by `Dataset#rdd`

[SPARK-49938][K8S] Prefer to use Hadoop configMap when specified

[WIP][SPARK-49816][SQL][FOLLOW-UP] Fix conflicting CTE ids

[MINOR] Fix code style in RuleIdCollection.scala

[DO-NOT-REVIEW][DRAFT] integration3

[SPARK-48922][SQL] Optimize nested data type insertion performance

[SPARK-49547][SQL][PYTHON] Support returning iterator of RecordBatches in applyInArrow

[SPARK-49711][SQL] Remove ExperimentalMethods

[SPARK-49921][CORE] Add task write data time to SQL tab's graph node

[SPARK-49919][SQL] Add special limits support for return content as JSON dataset

← Metadata

Owner

Metadata

spark spark copied to clipboard

Metadata

← Metadata

Owner

Metadata

spark
spark copied to clipboard