hive-testbench
hive-testbench copied to clipboard
Testbench for experimenting with Apache Hive at any data scale.
For example, tables `nation` and `region` are generated not once, but the number of the scale factor. e.g. at 300, there is 300x the data generated for each. See commit...
In your blog post [Benchmarking Apache Hive 13 for Enterprise Hadoop](http://hortonworks.com/blog/benchmarking-apache-hive-13-enterprise-hadoop/) you site this repo as the souce, but Hive 0.10 requires ANSI SQL-92 join syntax and the `hive13` branch...
Can you provide some clarity about what branch should be used for what version of Hive/Tez? The `hive13` branch README.md references `testbench.settings` but the file is `init.settings` in that branch....
hive-testbench is not Hive 0.13.1 compatible as it stands in trunk. See http://stackoverflow.com/questions/24316492/unable-to-configure-hive-exec-hooks-due-to-missing-jar for more info. What is the purpose of these settings, and are they a requirement?
Running tpcds_build.sh gives: In file included from w_store_returns.c:40: ./w_store_sales.h:36:9: warning: 'W_STORE_SALES_H' is used as a header guard here, followed by #define of a different macro [-Wheader-guard] # ifndef W_STORE_SALES_H ```...