tipoca-stream icon indicating copy to clipboard operation
tipoca-stream copied to clipboard

DB archival can cause slowness in full reload over time

Open justjkk opened this issue 5 years ago • 2 comments

Multiple tables like communicator.genericsms archive data older than a few months to S3 in order to save space. Over time this can cause more entries in Kafka that slows down a full reload. Need to figure out a way to periodically refresh the Debezium data so that data is stored as snapshots rather than events.

justjkk avatar Mar 05 '21 10:03 justjkk

This issue can also happen due to some tables that are doing a lot of updates like sessions table last_accessed_at column.

justjkk avatar Mar 05 '21 10:03 justjkk

Log compaction can be used here. It can help keep only the latest record for key. @justjkk

alok87 avatar Jul 29 '21 17:07 alok87