data-diff
data-diff copied to clipboard
CSV insertions for database for better benchmarking
Currently if you run the benchmarking scripts (see README and https://github.com/datafold/data-diff/pull/135) it's very slow against the cloud databases. It would be better to use CSV imports for the cloud databases (redshift, bigquery, oracle, snowflake) by doing something similar to dev/_bq_import_csv.py in _insert_to_table.
It will work today for 100M rows, but it'll be very slow...