replicator
replicator copied to clipboard
replicator is a toolkit for ingesting logical replication feeds into a CockroachDB cluster
In scenarios where the target schema does not define tables of the same names used in an incoming CDC feed (e.g. if using a `dispatch` function in the userscript that...
It should be straightforward to have a standard performance benchmark command that will characterize the performance of a user's staging and target databases. This command should exercise the `stage` and...
We have apply_duration_seconds_bucket to track apply performance, however we don't track the size of mutation batches we are upserting/deleting.
While these steps are available in the Postgres documentation, it would be useful to explicitly mention them: - wal_level should be set to "logical" - During bulk migration, "SELECT pg_export_snapshot()"...
For simple use cases, we should support at least a simple way of copying the table data from the source db to the target cluster. If we use a `pg_export_snapshot()`,...
Discussion issue for thinking about how two clusters could be kept in eventual-sync: * Source/destination cluster tracking column to avoid looping cdc-sink'ed mutations * MVCC timestamp comparison to drop too-old...
We need a playbook that describes how to use a CRDB backup to initially populate the target database, and then how to create a changefeed that starts from the backup's...
We should support consuming a CRDB core changefeed, to support a "pull"-based model if the source cluster is not able to create outgoing network connections to a cdc-sink deployment. This...