cudf
cudf copied to clipboard
Reducing runtime of JSON reader options benchmark
Description
This PR cleans up the JSON reader options benchmark by reducing the number of runtime configurations from 162 to 20. Reasoning behind the splitting of the benchmark -
- The
normalize_single_quotesandnormalize_whitespaceare pre-processing options and do not impact each other - the runtimes of the FSTs are additive. - The performance of raw input ingestion (
row_selection::ALLandrow_selection::BYTE_RANGE) is independent of the token generation and tree algorithms.
Checklist
- [X] I am familiar with the Contributing Guidelines.
- [ ] New or existing tests cover these changes.
- [ ] The documentation is up to date with these changes.