Wenchen Fan comments

Results 245 comments of


                                            Wenchen Fan

[SPARK-47050][SQL] Collect and publish partition level metrics

Can we split this PR into two? IIUC the DS v1 change can benefit file source tables immediately if `spark.sql.statistics.size.autoUpdate.enabled` is enabled. For the DS v2 part, do we support...

[SPARK-46707][SQL][FOLLOWUP] Push down throwable predicate through aggregates

thanks, merging to master!

[SPARK-42199][SQL] Fix issues around Dataset.groupByKey

@EnricoMi sorry this PR is lost track. Have you addressed all the review comments?

[SPARK-46937][SQL] Improve concurrency performance for FunctionRegistry

Shall we revert this if https://github.com/apache/spark/pull/44976#discussion_r1630428579 is a real issue? I don't think this is a critical path for performance (how much parallelism do you expect for function lookups in...

[SPARK-46937][SQL] Improve concurrency performance for FunctionRegistry

I've sent out the revert PR: https://github.com/apache/spark/pull/46940

[Spark] Fix time option evaluation

cc @scottsand-db

[BUG][SPARK] listTables() fails after createOrReplaceTempView('abc') called with PARSE_SYNTAX_ERROR

@felipepessoto thanks for providing the repro! What was the error you hit? And can you also post the result of `spark.sessionState.executePlan(plan).analyzed.treeString`?

[BUG][SPARK] listTables() fails after createOrReplaceTempView('abc') called with PARSE_SYNTAX_ERROR

one workaround is to set `spark.sql.legacy.useV1Command` to true. Ideally `DeltaCatalog` should not return views in `listTables`.

Add forceSchema option to output to specified schema

If the spark schema doesn't match the specified avro schema, what shall we do? And shall we allow compatible schema changing like int to long?

Add forceSchema option to output to specified schema

As a start, I think we can simply require the spark schema to be same as avro schema, while accepting namespace/field name difference.