PaddleOCR icon indicating copy to clipboard operation
PaddleOCR copied to clipboard

关于kie数据集标注

Open LJY6356 opened this issue 2 years ago • 3 comments

采用PPOCRLabelv2如何生成官方指定的如下数据集格式 zh_train_0.jpg [{"transcription": "汇丰晋信", "label": "other", "points": [[104, 114], [530, 114], [530, 175], [104, 175]], "id": 1, "linking": []}, {"transcription": "受理时间:", "label": "question", "points": [[126, 267], [266, 267], [266, 305], [126, 305]], "id": 7, "linking": [[7, 13]]}, {"transcription": "2020.6.15", "label": "answer", "points": [[321, 239], [537, 239], [537, 285], [321, 285]], "id": 13, "linking": [[7, 13]]}]

LJY6356 avatar Nov 09 '22 07:11 LJY6356

照着文档操作,标注完后导出就是这个格式了

MissPenguin avatar Nov 14 '22 11:11 MissPenguin

照着文档操作,标注完后导出就是这个格式了

"linking"没有这个字段

LJY6356 avatar Nov 15 '22 01:11 LJY6356

同问 PPOCRLabelv2 做re 关系抽取 怎么标注每个字段的的id与连接信息

smallhaozigithub avatar Nov 15 '22 08:11 smallhaozigithub

RE(关系抽取)任务的标注暂不支持

MissPenguin avatar Nov 30 '22 07:11 MissPenguin