matrixone
matrixone copied to clipboard
[Bug]: [date 3.23]tke regression: sysbench test after reported Duplicate entry error not exit ,always hung
Is there an existing issue for the same bug?
- [X] I have checked the existing issues.
Branch Name
main
Commit ID
2674995df
Other Environment Information
- Hardware parameters:
- OS type:
- Others:
Actual Behavior
job:https://github.com/matrixorigin/mo-nightly-regression/actions/runs/8403426531/job/23017476777
之前也出现过Duplicate entry错误,测试会直接退出不再继续运行,但最近几天报了该错误后无法退出一直hung,怀疑很多session没有closed,需要定位
log: https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22WhO%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-nightly-regression-20240323%5C%22%7D%20%7C%3D%20%60%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221711238284000%22,%22to%22:%221711238338000%22%7D%7D%7D&schemaVersion=1&orgId=1
Expected Behavior
No response
Steps to Reproduce
sysbench 10w read_write test
Additional information
No response
@ck89119 帮忙看下。
date 5.3 tke regression
https://github.com/matrixorigin/mo-nightly-regression/actions/runs/8940629660/job/24569679599
mo log:
https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22Qq4%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-nightly-regression-20240503%5C%22%7D%20%7C%3D%20%60%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221714768123000%22,%22to%22:%221714771808000%22%7D%7D%7D&schemaVersion=1&orgId=1
五一期间出现的问题,非必现,profile自动清理掉了,等下次复现在定位
date 5.16 regression 复现,报了dup后现象mo hung住了
job:https://github.com/matrixorigin/mo-nightly-regression/actions/runs/9114235510/job/25069419376
mo log: https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22C5l%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-nightly-regression-20240516%5C%22%7D%20%7C%3D%20%60%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221715890350000%22,%22to%22:%221715892044000%22%7D%7D%7D&schemaVersion=1&orgId=1
profile: etl_profile.zip
https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22C5l%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-nightly-regression-20240516%5C%22%7D%20%7C%3D%20%60found%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221715890350000%22,%22to%22:%221715893184000%22%7D%7D%7D&schemaVersion=1&orgId=1
long running的
leak 今天会把FOOTPRINT加上。
https://grafana.ci.matrixorigin.cn/goto/z4ewk7PIR?orgId=1
@daviszhen 上面这个是卡住在等锁的线程,在等待的事务,所有等待的事务都是leak的那个事务。是leak引起的
txn leak:018f8309b98a701784181859b15fb371
20:13:27 sql failed, but txn is still active
@daviszhen please help to resolve this issue
https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22C5l%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-nightly-regression-20240516%5C%22%7D%20%7C%3D%20%60018f8309b98a701784181859b15fb371%60%20%21%3D%20%60wait%20too%20long%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221715890350000%22,%22to%22:%221715893184000%22%7D%7D%7D&schemaVersion=1&orgId=1
duplicate的问题,可能跟 #15843 是一个事情 huang的问题不确定
这个需要测试脚本去修改。
未投入
未投入
未投入