Robert (Bobby) Evans comments

Results 207 comments of


                                            Robert (Bobby) Evans

[BUG] JsonToStructs and ScanJson do not normalize numeric output when read as a string

Another odd example of this is +INF and -INF. Even if allowNonNumericNumbers is disabled +INF and -INF are valid floats and are normalized to "Infinity" and "-Infinity" respectively. And the...

[BUG] JsonToStructs and ScanJson do not normalize numeric output when read as a string

Technically in Spark 4.0 this was reverted (at least for scan by default) https://issues.apache.org/jira/browse/SPARK-48148 https://github.com/apache/spark/pull/46408 This functionality was put under a config `spark.sql.json.enableExactStringParsing` with it on by default. It appears...

[AUDIT] [SPARK-48466][SQL] Create dedicated node for EmptyRelation in AQE

I was able to get the code to fallback, but I don't think that it matters at all. ``` val df1 = spark.read.parquet("/data/tpcds/SF200_parquet_decimal/store_sales/").select("ss_sold_date_sk", "ss_sold_time_sk").filter("ss_sold_date_sk = 0") val df2 = spark.read.parquet("/data/tpcds/SF200_parquet_decimal/store_sales/").selectExpr("ss_sold_date_sk",...

Robert (Bobby) Evans

[BUG] JsonToStructs and ScanJson do not normalize numeric output when read as a string

[BUG] JsonToStructs and ScanJson do not normalize numeric output when read as a string

[AUDIT] [SPARK-48466][SQL] Create dedicated node for EmptyRelation in AQE

[FEA] Force use PERFILE scan in low shuffle merge.

[FEA] Rework GpuSubstringIndex to use cudf::slice_strings

[FEA] See if an intermediate merge helps reduce memory in hash aggregate

[FEA] triple buffering/pipelineing for SQL