Robert (Bobby) Evans comments

Results 204 comments of


                                            Robert (Bobby) Evans

Add in config to avoid AST in some join cases

> Would it make sense to have RapidsMeta have the ability for expressions to be disabled for AST individually if spark.rapids.sql.expression.ast._classname_ is false? We probably still want an overall AST...

[BUG] Integration test `test_re_replace_all` fails with a corner case

It looks like this might be related to how Spark/Python interprets the string 'a\x85'. ``` import pyspark.sql.types df = spark.createDataFrame(SparkContext.getOrCreate().parallelize([("a\x85")]), pyspark.sql.types.StringType()) spark.conf.set("spark.rapids.sql.enabled", False) df.selectExpr('CAST(value as BINARY)').show() +----------+ | value| +----------+...

Robert (Bobby) Evans

Add in config to avoid AST in some join cases

[BUG] Integration test `test_re_replace_all` fails with a corner case

[FEA] Chunked ORC reading

Change the java API so a global default host allocator can be set.

[BUG] double free or memory corruption when parsing some JSON

JSON reader validation of values

[BUG] JSON white space normalization removes too much for unquoted values

[FEA] Improve performance of high-multiplicity joins

[FEA] Improve performance of high-multiplicity joins

Figure out why `MapFromArrays ` appears in the tests for hive parquet write