matrixone icon indicating copy to clipboard operation
matrixone copied to clipboard

[Bug]: mo_cdc: update mo table, downstream mysql row count first decrease and then increase

Open heni02 opened this issue 1 year ago • 2 comments

Is there an existing issue for the same bug?

  • [X] I have checked the existing issues.

Branch Name

main

Commit ID

dbced45

Other Environment Information

- Hardware parameters:
- OS type:
- Others:

Actual Behavior

创建cdc任务,然后上游mo执行update长事务,下游观察mysql数据先变少,后变多 企业微信截图_c5e70670-9582-450e-a6b3-28e490bd43f7 过几分钟后开始变多 企业微信截图_e7c6c2e7-8783-44a3-ab44-dcb8a45c0323

暂时还不清楚暂停任务是否会有数据不一致的影响,后续暂停任务问题解决后补充测试结果

Expected Behavior

No response

Steps to Reproduce

CREATE TABLE ORDERS_15k(
O_ORDERKEY       BIGINT NOT NULL,
O_CUSTKEY        INTEGER NOT NULL,
O_ORDERSTATUS    CHAR(1) NOT NULL,
O_TOTALPRICE     DECIMAL(15,2) NOT NULL,
O_ORDERDATE      DATE NOT NULL,
O_ORDERPRIORITY  CHAR(15) NOT NULL,
O_CLERK          CHAR(15) NOT NULL,
O_SHIPPRIORITY   INTEGER NOT NULL,
O_COMMENT        VARCHAR(79) NOT NULL,
PRIMARY KEY (O_ORDERKEY)
);
load data url s3option {'endpoint'='http://minio.minio-mo.svc.cluster.local','access_key_id'='xxx','secret_access_key'='xxx','bucket'='mo-load-data', 'filepath'='tpch_10/orders.tbl','provider'='minio'} into table orders_15k fields terminated by '|' lines terminated by '\n' parallel 'true';
update ORDERS_15k set O_COMMENT='124' where O_ORDERKEY>3874334;
./mo_cdc task create --task-name "cdc_15k_new" --source-uri="mysql://dump:[email protected]:6001" --sink-type="mysql" --sink-uri="mysql://dump:[email protected]:3306"    --tables='test_db.orders_15k:test_cdc_db.ORDERS_15k' --level="account"  --account="sys"

Additional information

No response

heni02 avatar Oct 12 '24 10:10 heni02

回归验证下游还是存在先减少后增加的现象 commit:72b1061 企业微信截图_85234070-916b-45e6-ae9f-0039a165d185 企业微信截图_d0d46d2b-e72b-4c23-899c-4ba4c1549c27 企业微信截图_0faeeed8-7bb5-4fba-aaf2-dc0b929c18db 同步后下游数据是一致的 企业微信截图_f6679ac1-8965-47ff-9428-9a2b64c97490

mo log: https://shanghai.idc.matrixorigin.cn:30001/explore?panes=%7B%22t-5%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-cdc-test%5C%22%7D%20%7C%3D%20%60tableReader%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221729665208959%22,%22to%22:%221729668808959%22%7D%7D%7D&schemaVersion=1&orgId=1

heni02 avatar Oct 23 '24 07:10 heni02

找到问题,修改中

ck89119 avatar Oct 23 '24 08:10 ck89119

待验证

ck89119 avatar Oct 24 '24 13:10 ck89119

已经解决

daviszhen avatar Oct 25 '24 08:10 daviszhen

update同步验证没有先减少后增加的现象了,但是同步数据需要大改12分钟(1500w 1.7G),感觉有点慢 企业微信截图_5891bcf2-0a75-4ab9-8d78-b16bc3f6a728 企业微信截图_8061641c-4597-458c-9f41-6b3eb7ef9d27

heni02 avatar Oct 25 '24 16:10 heni02

性能问题继续分析,该问题先closed

heni02 avatar Oct 25 '24 16:10 heni02