matrixone icon indicating copy to clipboard operation
matrixone copied to clipboard

[Bug]: [date 3.12]standalone regression: tpcc 100warehouse 1000threads reported many connection timed out

Open heni02 opened this issue 1 year ago • 3 comments

Is there an existing issue for the same bug?

  • [X] I have checked the existing issues.

Branch Name

main

Commit ID

7164ded60b563e2d5f3cc4cd411bd892816e2a93

Other Environment Information

- Hardware parameters:
- OS type:
- Others:

Actual Behavior

job:https://github.com/matrixorigin/mo-nightly-regression/actions/runs/8250543191/job/22565428905 企业微信截图_bf75fa59-c7e3-486b-bd7e-74c46edf567d 企业微信截图_f078f11d-1412-4a40-a989-5507f72d0f60

log: 报超时时间是2024-03-13 07:28:10,2024/03/13 07:30:12报panic错误,期间日志里有大量的read tcp4 127.0.0.1:36974->127.0.0.1:32001: i/o timeout"错误 企业微信截图_b5c2aa1c-a5fc-4d25-b5b3-e2d0c7cd16d5 企业微信截图_fc481f21-929b-4e21-a686-e8b0a5ebfbdf whole logs plz contact to me

Expected Behavior

No response

Steps to Reproduce

tpcc 100 warehouse 1000threads test

Additional information

No response

heni02 avatar Mar 13 '24 03:03 heni02

看着是前端空指针,麻烦看一下

nnsgmsone avatar Mar 13 '24 03:03 nnsgmsone

@ck89119 help to find something.

daviszhen avatar Mar 13 '24 04:03 daviszhen

并发场景可能会出现同时deleteRoutine的情况,需要先判断rt是否为nil

ck89119 avatar Mar 14 '24 06:03 ck89119

panic问题已修复,出现timeout的原因没看出来,不确定是不是panic影响导致的,先继续观察,再次出现timeout的时候再分析。

ck89119 avatar Mar 20 '24 06:03 ck89119

@ck89119 现在回归没有该问题了,超时问题加一些日志?下次复现可以直接定位 https://github.com/matrixorigin/mo-nightly-regression/actions/runs/8347491353 企业微信截图_d39087f2-c004-44cb-afcd-886896e5d3c1

heni02 avatar Mar 20 '24 06:03 heni02

先关闭

heni02 avatar Mar 20 '24 06:03 heni02