Replibyte icon indicating copy to clipboard operation
Replibyte copied to clipboard

Looking for a PostgreSQL and MongoDB dump between 1TB and 30TB of data?

Open evoxmusic opened this issue 2 years ago • 5 comments

Hi, I am looking for a PostgreSQL and MongoDB data dump between 1TB and 30TB. The idea would be to build a pipeline test of RepliByte with a very large dump file. It will help a lot to optimize RepliByte. Anyone?

evoxmusic avatar Mar 30 '22 21:03 evoxmusic

I don't know if this suggestion will help you, but an alternate solution would be to create your own dump, if no one can provide one. A simple script would be able to prepare the db with the amount of data you want.

Vagelis-Prokopiou avatar Apr 01 '22 09:04 Vagelis-Prokopiou

It's a good idea and easier today I guess. Happy to have your contrib if it's doable for you :)

evoxmusic avatar Apr 01 '22 10:04 evoxmusic

Would be happy to help but creating such a dump is not feasible for me due to insufficient hard drive space :-)

Vagelis-Prokopiou avatar Apr 01 '22 10:04 Vagelis-Prokopiou

For PostgreSQL, you could generate the data with pgbench, something along the lines of:

psql -c 'CREATE DATABASE pg_test'
pgbench -i -s 75000 pg_test

The scaling factor is based on an older wiki entry: https://wiki.postgresql.org/wiki/Pgbenchtesting

markrechler avatar May 16 '22 14:05 markrechler

Thank you, super useful

evoxmusic avatar May 16 '22 19:05 evoxmusic