datafusion-ballista icon indicating copy to clipboard operation
datafusion-ballista copied to clipboard

Enable benchmark data validation for distributed execution

Open andygrove opened this issue 4 years ago • 1 comments

Is your feature request related to a problem or challenge? Please describe what you are trying to do. The TPC-H benchmark suite already has a feature for verifying that results are correct when executing in-memory with DataFusion. It would be good to extend this support to distributed execution with Ballista.

Describe the solution you'd like I would like an option to run the benchmark in data validation mode when executing against a Ballista cluster.

Describe alternatives you've considered None

Additional context None

andygrove avatar Apr 25 '21 13:04 andygrove

I can take this up ✌️

msathis avatar Apr 26 '21 08:04 msathis