bugbug
bugbug copied to clipboard
Evaluate switching to pickle DBs
They should be faster to read and write (we need to confirm).
Since we started using orjson, this might no longer be useful. I quickly run a couple of basic benchmarks and the orjson implementation is basically on par with the pickle one, but we need to do the benchmarking a bit more seriously in order to confirm.
Parquet formats are also designed for efficient read and write operations on large datasets.