flink-cdc
flink-cdc copied to clipboard
Flink CDC is a streaming data integration tool
This is the way to improve data processing speed during the phase from snapshot to incremental phase. https://issues.apache.org/jira/browse/FLINK-36798 And the complexity of `log(n)` refers to this issue:https://github.com/apache/flink-cdc/pull/3630.
At present, FlinkCDC only supports reading Flink configurations from Flink con files, but this approach is not user-friendly for multiple jobs that require different configurations. Allow passing Flink configuration by...
PR from https://github.com/apache/flink-cdc/pull/2571 Ignore unnecessary fetcher shutdowns To hide the element queue from the connector developers and make SplitFetcherManager the only owner class of the queue It's enough to refresh...
In Multi Transform, verify that column counts, metadata fields like primaryKeys, partitionKeys, and options are consistent
**Questions**: 1. Should we synchronize this request with schema changes? I.e. respond with "success" only if the coordinator is idle; otherwise, respond with "busy" and have the operator retry.
Use Incremental Source Framework in Flink CDC TiKV Source Connector
The commons-codec dependency should not be declared in parent pom with test scope because connector module may depend on it as a transitive dependency, and it will not be packaged...
**Background** In some scenarios, MySQL synchronization only expects to synchronize specified fields instead of all fields in the table. 1. The user only has the permission for some fields in...
Impove mysql schema performance pr https://github.com/apache/flink-cdc/pull/3981
https://issues.apache.org/jira/browse/FLINK-36631 OracleCDC currently only supports initial and latest-offset. Add specific-offset to support reading incremental data from a specified SCN.