hive-testbench
hive-testbench copied to clipboard
hive-testbench doesn't generate database
Hello,
I'm using hive-testbench-hdp3 on HDP3.1.4. A bunch of files were generated in target HDFS directory after running tpcds-setup.sh. However no database was generated. Any advice to address the issue?
My steps are like:
-
run tpcds-build.sh
-
run 'FORMAT=parquet ./tpcds-setup.sh 10 /benchmarks/tpcds'. Note it reported an error as below
TPC-DS text data generation complete. Loading text data into external tables. make: *** [date_dim] Error 1 Loading constraints Data loaded into database tpcds_bin_partitioned_parquet_10.
- check databses in hive - there is no new databse generated.
Most likely beeline command needs to be modified in line no 73 of tpcds-setup.sh as per your setup. Add DEBUG_SCRIPT=ON to get verbose logging, that should help you to resolve your issue.
Thanks for the advice. I ran the commands in tpcds-setup.sh manually. It seems the issue is about "make -j 1 -f $LOAD_FILE". This line returns error as below. Do you know what the problem is about?
No implicit rule found for all'. Considering target file
date_dim'.
File date_dim' does not exist. Finished prerequisites of target file
date_dim'.
Must remake target date_dim'. Invoking recipe from load_parquet_2.mk:3 to update target
date_dim'.
Putting child 0x24cc530 (date_dim) PID 21140 on the chain.
Live child 0x24cc530 (date_dim) PID 21140
Reaping losing child 0x24cc530 PID 21140
make: *** [date_dim] Error 1
Removing child 0x24cc530 PID 21140 from chain.
Finally I got all commands run successfully manually. Thanks.
Hi @vivian5-git - how did you resolve the issue? I am facing similar issue.
Finally I got all commands run successfully manually. Thanks.
Where did you modify it
how did you resolve the issue? I am facing similar issue.
About this issue, I think you can refer to: https://community.cloudera.com/t5/Support-Questions/Not-able-to-run-hive-benchmark-test/m-p/137781
The process of loading is text file table->orc table. The default table type (hive.default.fileformat) of my hive environment is orc, then the text file table ddl may become orc type by mistake, and then the same error will occur. This problem can be solved by changing the default type setting in /settings/load-flat.sql:
Set hive.default.fileformat=TextFile
you can find the error message in hive server2 log.