tpcds-kit icon indicating copy to clipboard operation
tpcds-kit copied to clipboard

Re-enable printing to stdout

Open ianbuss opened this issue 7 years ago • 2 comments

ianbuss avatar Apr 07 '17 13:04 ianbuss

It is more complicated than this to get this working correctly. The challenge is that the sales/returns tables are generated in pairs and there is no way to generate only one of them. I've recently been using https://github.com/teradata/tpcds because it is much faster than the TPC version written in C (surprising, I know).

gregrahn avatar Apr 10 '17 14:04 gregrahn

Yes, I resorted to the rather ugly approach of relying on the fact that child tables all have a different number of fields for now, which is just nasty. Going straight from dsdgen to Parquet using Spark which takes away the requirement for passwordless SSH for distribution etc, but I will definitely check out the link, thanks.

ianbuss avatar Apr 10 '17 15:04 ianbuss