spark icon indicating copy to clipboard operation
spark copied to clipboard

[SPARK-49940][CONNECT]Make 'limit n' in spark connect use executeCollect

Open xieshuaihu opened this issue 4 months ago • 0 comments

What changes were proposed in this pull request?

Call executeCollect when the DataFrame is CollectLimitExec or CollectTailExec.

Why are the changes needed?

For limit execution, SparkPlan.executeCollect is more performant than SparkPlan.execute.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Existing tests

Was this patch authored or co-authored using generative AI tooling?

No

xieshuaihu avatar Oct 12 '24 09:10 xieshuaihu