matrixone icon indicating copy to clipboard operation
matrixone copied to clipboard

[Bug]: Stability test tpcc reported error: ResultSet is from UPDATE. No Data

Open heni02 opened this issue 1 year ago • 20 comments

Is there an existing issue for the same bug?

  • [X] I have checked the existing issues.

Branch Name

main

Commit ID

d50211bca84238c88eae7028a67fe7fb14e859c1

Other Environment Information

- Hardware parameters:
- OS type:
- Others:

Actual Behavior

稳定性测试是一种sysbench10万,tpcc 10仓10终端,tpch100G混合场景长时间的测试,该错误是在tpcc测试时出现的 企业微信截图_ec801b2e-7379-4afd-ab52-d4d971477b50

mo log: http://10.222.6.1/explore?panes=%7B%2222b%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-stability-regression%5C%22%7D%20%7C%3D%20%60%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221703063890000%22,%22to%22:%221703063890000%22%7D%7D%7D&schemaVersion=1&orgId=1

mem use: http://10.222.6.1/d/85a562078cdf77779eaa1add43ccec1e/kubernetes-compute-resources-namespace-pods?orgId=1&from=1703041729328&to=1703074206027&var-datasource=prometheus&var-cluster=&var-namespace=mo-stability-regression

Expected Behavior

No response

Steps to Reproduce

稳定性测试

Additional information

No response

heni02 avatar Dec 22 '23 11:12 heni02

看上去是 out of memory 导致的,应该先修复内存问题

reusee avatar Dec 25 '23 02:12 reusee

问题出现的时间点 utc 2023/12/20 09:18:10。

mpool 有oom。

image 同时在这个区间内 pod 有oom重启。

image

http://10.222.6.1/d/cluster-detail-namespaced/cluster-detail-namespaced?orgId=1&from=1703059729000&to=1703074206000&var-namespace=mo-stability-regression&var-account=All&var-interval=$__auto_interval_interval&var-cluster=.%2A&var-loki=loki

daviszhen avatar Dec 26 '23 06:12 daviszhen

同时还有些 pipeline panic问题,不确定是mpool引起的。

https://github.com/matrixorigin/matrixone/issues/13741

image

daviszhen avatar Dec 26 '23 07:12 daviszhen

http://10.222.6.1/explore?panes=%7B%2222b%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-stability-regression%5C%22%7D%20%7C%3D%20%60out%20of%20memory%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221703063890000%22,%22to%22:%221703064010000%22%7D%7D%7D&schemaVersion=1&orgId=1

daviszhen avatar Dec 26 '23 07:12 daviszhen

OOM问题,辛苦莫尘统一处理一下 @heni02 看是否合并成一个issue

ouyuanning avatar Dec 26 '23 07:12 ouyuanning

tpcc工具日志: benchmarkinfo.tar.gz

heni02 avatar Dec 27 '23 07:12 heni02

正在和存储的同事协商https://github.com/matrixorigin/matrixone/issues/12532

nnsgmsone avatar Jan 03 '24 10:01 nnsgmsone

内存问题等待#12532

nnsgmsone avatar Jan 08 '24 10:01 nnsgmsone

内存问题等待#12532

nnsgmsone avatar Jan 12 '24 10:01 nnsgmsone

内存问题等待#12532

nnsgmsone avatar Jan 17 '24 10:01 nnsgmsone

内存问题等待https://github.com/matrixorigin/matrixone/issues/12532

nnsgmsone avatar Jan 22 '24 10:01 nnsgmsone

内存问题等待https://github.com/matrixorigin/matrixone/issues/12532

nnsgmsone avatar Jan 25 '24 10:01 nnsgmsone

内存问题等待https://github.com/matrixorigin/matrixone/issues/12532

nnsgmsone avatar Jan 30 '24 10:01 nnsgmsone

内存问题等待https://github.com/matrixorigin/matrixone/issues/12532

nnsgmsone avatar Feb 02 '24 10:02 nnsgmsone

no process

nnsgmsone avatar Feb 21 '24 13:02 nnsgmsone

no process

nnsgmsone avatar Feb 26 '24 10:02 nnsgmsone

处理数据正确性问题中

nnsgmsone avatar Feb 29 '24 10:02 nnsgmsone

no process

nnsgmsone avatar Mar 05 '24 10:03 nnsgmsone

no process

nnsgmsone avatar Mar 08 '24 10:03 nnsgmsone

no process

nnsgmsone avatar Mar 13 '24 10:03 nnsgmsone

no process

nnsgmsone avatar Mar 18 '24 11:03 nnsgmsone

no process

nnsgmsone avatar Mar 21 '24 10:03 nnsgmsone

未在出现过,先关闭

aressu1985 avatar Apr 01 '24 01:04 aressu1985