pubsub-backup
pubsub-backup copied to clipboard
Streaming insert limit
Streaming insert limits are documented here: https://cloud.google.com/bigquery/quotas#streaming_inserts
This project uses fixed windows of a minute: https://github.com/alexvanboxel/pubsub-backup/blob/7dd0f127ddbfd65ab3d48133651ce090ed336b73/src/main/java/backup/BackupPipeline.java#L59
If you end up going over the limit for streaming inserts, this might cause some issues with your pipeline.
What do you think about using something like this?
Composite trigger that fires whenever the pane has at least 100 elements, or after a minute.
Repeatedly.forever(AfterFirst.of(
AfterPane.elementCountAtLeast(100),
AfterProcessingTime.pastFirstElementInPane().plusDelayOf(Duration.standardMinutes(1))))
where the number 100 can be configured/calculated based on the size of the row to not go over the streaming insert limits.