matrixone icon indicating copy to clipboard operation
matrixone copied to clipboard

[Bug]: [date 5.9]tke regression: load data reported the table column is larger than input data column

Open heni02 opened this issue 1 year ago • 4 comments

Is there an existing issue for the same bug?

  • [X] I have checked the existing issues.

Branch Name

main

Commit ID

1aed8d948

Other Environment Information

- Hardware parameters:
- OS type:
- Others:

Actual Behavior

job:https://github.com/matrixorigin/mo-nightly-regression/actions/runs/9019459807/job/24803217771 企业微信截图_28dce51e-a38c-42cf-8d4e-5493a2d920fb

mo log: https://grafana.ci.matrixorigin.cn/explore?panes=%7B%22fzn%22:%7B%22datasource%22:%22loki%22,%22queries%22:%5B%7B%22refId%22:%22A%22,%22expr%22:%22%7Bnamespace%3D%5C%22mo-nightly-regression-20240509%5C%22%7D%20%7C%3D%20%60%60%22,%22queryType%22:%22range%22,%22datasource%22:%7B%22type%22:%22loki%22,%22uid%22:%22loki%22%7D,%22editorMode%22:%22builder%22%7D%5D,%22range%22:%7B%22from%22:%221715317956000%22,%22to%22:%221715317980000%22%7D%7D%7D&schemaVersion=1&orgId=1

Expected Behavior

No response

Steps to Reproduce

tke regression sysbench 1000w update-index test

Additional information

No response

heni02 avatar May 10 '24 06:05 heni02

错误日志: image

报错的代码: image

看着比较奇怪。这里的含义是。解析到的行中,有一个列的,本来应该是datetime类型的,但是现在它的值无法转成datetime类型。 而其实际内容是 一个 报错信息。

这个报错看着不像是load sysbench文件导入的信息。sysbench的信息应该不会有那么怪异的内容。看着倒像是log信息

ouyuanning avatar May 10 '24 09:05 ouyuanning

image

应该有2个问题 1、在load导入的时候,发生了一次报错"the table column is larger than input data column" image

2、这个报错信息被记录后,被日志继续导入到另外一个地方。这个时候的报错是无法解析为datetime类型 这个部分应该是对于输入的内容的切割不对

ouyuanning avatar May 10 '24 09:05 ouyuanning

等这个PR https://github.com/matrixorigin/matrixone/pull/16027 合并后再验证下

ouyuanning avatar May 15 '24 10:05 ouyuanning

https://github.com/matrixorigin/matrixone/pull/16027 这个PR能解决2的问题。

1的部分才是关键。但是拿其中一个sbtest2.csv(2G)在本地跑了几次没有复现。

ouyuanning avatar May 17 '24 11:05 ouyuanning

还没时间继续看

ouyuanning avatar May 22 '24 11:05 ouyuanning

同上

jensenojs avatar May 27 '24 10:05 jensenojs

同上

jensenojs avatar May 30 '24 10:05 jensenojs

同上

jensenojs avatar Jun 04 '24 10:06 jensenojs

近期没有复现。可能我之前的那个pr也有修复了。但是不太能确定。

ouyuanning avatar Jun 06 '24 06:06 ouyuanning

待逻辑检验是否修掉了

jensenojs avatar Jun 11 '24 10:06 jensenojs

image

看着当时的报错是这个sql

load data url s3option { "endpoint" = "http://cos.ap-guangzhou.myqcloud.com",
"access_key_id" = "******",
"secret_access_key" = "******",
"bucket" = "mo-load-guangzhou-1308875761",
"filepath" = "sysbench_1000w_auto_index/sbtest8.csv" } INTO TABLE sbtest8 FIELDS TERMINATED BY "," lines TERMINATED BY "\\n" parallel "true"

具体的报错error看着把报错信息本身当成了DataTime的信息去解析, 需要思考为什么会出现这个场景. image

jensenojs avatar Jun 13 '24 08:06 jensenojs

datetime类型的问题有可能已经解决掉了。是后台服务load inline的时候,转义的问题。

ouyuanning avatar Jun 14 '24 02:06 ouyuanning

这个pr会在1.2.1中的load 开启严格模式

  • https://github.com/matrixorigin/matrixone/pull/16907

jensenojs avatar Jun 14 '24 09:06 jensenojs

等待观察

jensenojs avatar Jun 19 '24 10:06 jensenojs

一直未能复现, 可能已经被修复了, pr16907在1.2.1中加了相关日志并且开启了严格模式, 在再次复现的时候能有更多信息做诊断.

jensenojs avatar Jun 24 '24 02:06 jensenojs

该问题单先关闭,待出夏侯,在重新提新问题跟踪

aressu1985 avatar Jul 02 '24 06:07 aressu1985