[Bug]: mo_cdc: update mo table, downstream mysql row count first decrease and then increase
Is there an existing issue for the same bug?
- [X] I have checked the existing issues.
Branch Name
main
Commit ID
dbced45
Other Environment Information
- Hardware parameters:
- OS type:
- Others:
Actual Behavior
创建cdc任务,然后上游mo执行update长事务,下游观察mysql数据先变少,后变多
过几分钟后开始变多
暂时还不清楚暂停任务是否会有数据不一致的影响,后续暂停任务问题解决后补充测试结果
Expected Behavior
No response
Steps to Reproduce
CREATE TABLE ORDERS_15k(
O_ORDERKEY BIGINT NOT NULL,
O_CUSTKEY INTEGER NOT NULL,
O_ORDERSTATUS CHAR(1) NOT NULL,
O_TOTALPRICE DECIMAL(15,2) NOT NULL,
O_ORDERDATE DATE NOT NULL,
O_ORDERPRIORITY CHAR(15) NOT NULL,
O_CLERK CHAR(15) NOT NULL,
O_SHIPPRIORITY INTEGER NOT NULL,
O_COMMENT VARCHAR(79) NOT NULL,
PRIMARY KEY (O_ORDERKEY)
);
load data url s3option {'endpoint'='http://minio.minio-mo.svc.cluster.local','access_key_id'='xxx','secret_access_key'='xxx','bucket'='mo-load-data', 'filepath'='tpch_10/orders.tbl','provider'='minio'} into table orders_15k fields terminated by '|' lines terminated by '\n' parallel 'true';
update ORDERS_15k set O_COMMENT='124' where O_ORDERKEY>3874334;
./mo_cdc task create --task-name "cdc_15k_new" --source-uri="mysql://dump:[email protected]:6001" --sink-type="mysql" --sink-uri="mysql://dump:[email protected]:3306" --tables='test_db.orders_15k:test_cdc_db.ORDERS_15k' --level="account" --account="sys"
Additional information
No response
回归验证下游还是存在先减少后增加的现象
commit:72b1061
同步后下游数据是一致的
mo log: https://shanghai.idc.matrixorigin.cn:30001/explore?panes=%7B%22t-5%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-cdc-test%5C%22%7D%20%7C%3D%20%60tableReader%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221729665208959%22,%22to%22:%221729668808959%22%7D%7D%7D&schemaVersion=1&orgId=1
找到问题,修改中
待验证
已经解决
update同步验证没有先减少后增加的现象了,但是同步数据需要大改12分钟(1500w 1.7G),感觉有点慢
性能问题继续分析,该问题先closed