Manoj Kumar comments

Results 23 comments of


                                            Manoj Kumar

I am trying above with https://github.com/oap-project/gazelle_plugin/releases/download/v1.4.0/gazelle-plugin-1.4.0-spark-3.2.1.jar , its getting crashed > please let me know your SQL statement for reproducing. even a simple select statement is crashing As per your...

[V1.4.0] gazelle plugin crash

@PHILO-HE @weiting-chen I tried conda env but its also not work out. ![image](https://user-images.githubusercontent.com/5130287/180999830-629bb202-a77f-4ba0-ae42-a05ee1f96b6d.png) Machine Info OS: Ubuntu 18.04.6 LTS x86_64 CPU: Intel i5-8400 (6) @ 4.000GHz Finally I git clone...

[V1.4.0] gazelle plugin crash

> @Manoj-red-hat Could you help to check if the issue is happened "during" the query or "after" the query. I would like to identify if this is a known issue...

[V1.4.0] gazelle plugin crash

@weiting-chen sorry for late reply got stuck in some-other work, anyways thanks for your support and indeed its a great project. Now I am exploring gazelle on tpcds side >...

[300G TPCH Benchmark] Analysis of 1.3.1 on single node

FYI .. @weiting-chen @PHILO-HE

[300G TPCH Benchmark] Analysis of 1.3.1 on single node

@zhouyuan I am not using Partitioned TPCH dataset And after more debugging it looks parquet --> arrow is pretty slow @attaching plans for your reference, see scan times [vanilla.pdf](https://github.com/oap-project/gazelle_plugin/files/8811117/vanilla.pdf) [arrow_q1.pdf](https://github.com/oap-project/gazelle_plugin/files/8811119/arrow_q1.pdf)

java.lang.ClassCastException: org.apache.spark.sql.execution.vectorized.OnHeapColumnVector cannot be cast to com.intel.oap.vectorized.ArrowWritableColumnVector

When I disable WSCG set spark.sql.codegen.wholeStage=false; Query works fine

java.lang.ClassCastException: org.apache.spark.sql.execution.vectorized.OnHeapColumnVector cannot be cast to com.intel.oap.vectorized.ArrowWritableColumnVector

I resolved this issue by using Arrow Data Source. Here is my observation 1) if Arrow data source is not enabled, please disable set spark.sql.codegen.wholeStage=false; (spark-sql) in order to run...

java.lang.ClassCastException: org.apache.spark.sql.execution.vectorized.OnHeapColumnVector cannot be cast to com.intel.oap.vectorized.ArrowWritableColumnVector

Hi @PHILO-HE , Ya I understand you guys are working hard on this, evident from daily commits. > Is there any reason for not using arrow data source on your...