Manoj Kumar
Manoj Kumar
FYI .. @PHILO-HE
I am trying above with https://github.com/oap-project/gazelle_plugin/releases/download/v1.4.0/gazelle-plugin-1.4.0-spark-3.2.1.jar , its getting crashed > please let me know your SQL statement for reproducing. even a simple select statement is crashing As per your...
@PHILO-HE @weiting-chen I tried conda env but its also not work out.  Machine Info OS: Ubuntu 18.04.6 LTS x86_64 CPU: Intel i5-8400 (6) @ 4.000GHz Finally I git clone...
> @Manoj-red-hat Could you help to check if the issue is happened "during" the query or "after" the query. I would like to identify if this is a known issue...
@weiting-chen sorry for late reply got stuck in some-other work, anyways thanks for your support and indeed its a great project. Now I am exploring gazelle on tpcds side >...
FYI .. @weiting-chen @PHILO-HE
@zhouyuan I am not using Partitioned TPCH dataset And after more debugging it looks parquet --> arrow is pretty slow @attaching plans for your reference, see scan times [vanilla.pdf](https://github.com/oap-project/gazelle_plugin/files/8811117/vanilla.pdf) [arrow_q1.pdf](https://github.com/oap-project/gazelle_plugin/files/8811119/arrow_q1.pdf)
When I disable WSCG set spark.sql.codegen.wholeStage=false; Query works fine
I resolved this issue by using Arrow Data Source. Here is my observation 1) if Arrow data source is not enabled, please disable set spark.sql.codegen.wholeStage=false; (spark-sql) in order to run...
Hi @PHILO-HE , Ya I understand you guys are working hard on this, evident from daily commits. > Is there any reason for not using arrow data source on your...