timescaledb-parallel-copy
timescaledb-parallel-copy copied to clipboard
Performance tuning tips
This is rather an open question. I am looking for any experience/tips anyone has regarding performance tuning to increase import speed. We are trying to import several billion rows of data from InfluxDB to TimescaleDB.
Batch size? Copy options? PostgreSQL tuning parameters?
Here are a few good tips: https://www.timescale.com/blog/13-tips-to-improve-postgresql-insert-performance/
Also, there's some good ideas for large datasets here: https://github.com/timescale/timescaledb-extras#useful-utilities
I'd also encourage you to enable the --report and start testing with different batch sizes and parallel processes. It will depend on the machine you're using.