PaddleOCR icon indicating copy to clipboard operation
PaddleOCR copied to clipboard

使用本地数据集上训练的RE进行推理时出错,KeyError: 'relations'

Open justgoonz opened this issue 4 months ago • 0 comments

🔎 Search before asking

  • [X] I have searched the PaddleOCR Docs and found no similar bug report.
  • [X] I have searched the PaddleOCR Issues and found no similar bug report.
  • [X] I have searched the PaddleOCR Discussions and found no similar bug report.

🐛 Bug (问题描述)

我输入的命令是

python3 ./tools/infer_kie_token_ser_re.py \
  -c  ./configs/kie/vi_layoutxlm/RE_front.yml\
  -o Architecture.Backbone.checkpoints=./output/10_14/RE_front/best_accuracy\
  Global.infer_img=./train_data/OCR_front/test/image\
  -c_ser ./configs/kie/vi_layoutxlm/RE_front.yml \
  -o_ser Architecture.Backbone.checkpoints=./output/10_14/RE_front/best_accuracy

然后遇到报错

Traceback (most recent call last):
  File "./tools/infer_kie_token_ser_re.py", line 216, in <module>
    result = ser_re_engine(data)
  File "./tools/infer_kie_token_ser_re.py", line 147, in __call__
    ser_results, ser_inputs = self.ser_engine(data)
  File "/home/user/1_xsz_ocr/PaddleOCR/tools/infer_kie_token_ser.py", line 115, in __call__
    batch = transform(data, self.ops)
  File "/home/user/1_xsz_ocr/PaddleOCR/ppocr/data/imaug/__init__.py", line 72, in transform
    data = op(data)
  File "/home/user/1_xsz_ocr/PaddleOCR/ppocr/data/imaug/vqa/token/vqa_token_relation.py", line 31, in __call__
    relations = data["relations"]
KeyError: 'relations'

我查看源码时发现,在infer时,data相关的代码段为

for idx, info in enumerate(infer_imgs):
            if config["Global"].get("infer_mode", None) is False:
                data_line = info.decode("utf-8")
                substr = data_line.strip("\n").split("\t")
                img_path = os.path.join(data_dir, substr[0])
                data = {"img_path": img_path, "label": substr[1]}
                print(substr)
            else:
                img_path = info
                data = {"img_path": img_path}

即data只有img_path和label字段.请问哪里出了问题

完整的输出如下

(ocr_pp) user@gpu2:~/1_xsz_ocr/PaddleOCR$ python3 ./tools/infer_kie_token_ser_re.py   -c  ./configs/kie/vi_layoutxlm/RE_front.yml  -o Architecture.Backbone.checkpoints=./output/10_14/RE_front/best_accuracy  Global.infer_img=./train_data/OCR_front/test/image  -c_ser ./configs/kie/vi_layoutxlm/RE_front.yml   -o_ser Architecture.Backbone.checkpoints=./output/10_14/RE_front/best_accuracy
[2024/10/14 19:20:16] ppocr INFO: ********** re config **********
[2024/10/14 19:20:16] ppocr INFO: Architecture : 
[2024/10/14 19:20:16] ppocr INFO:     Backbone : 
[2024/10/14 19:20:16] ppocr INFO:         checkpoints : ./output/10_14/RE_front/best_accuracy
[2024/10/14 19:20:16] ppocr INFO:         mode : vi
[2024/10/14 19:20:16] ppocr INFO:         name : LayoutXLMForRe
[2024/10/14 19:20:16] ppocr INFO:         pretrained : True
[2024/10/14 19:20:16] ppocr INFO:     Transform : None
[2024/10/14 19:20:16] ppocr INFO:     algorithm : LayoutXLM
[2024/10/14 19:20:16] ppocr INFO:     model_type : kie
[2024/10/14 19:20:16] ppocr INFO: Eval : 
[2024/10/14 19:20:16] ppocr INFO:     dataset : 
[2024/10/14 19:20:16] ppocr INFO:         data_dir : /home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/test/image
[2024/10/14 19:20:16] ppocr INFO:         label_file_list : ['/home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/test/test.json']
[2024/10/14 19:20:16] ppocr INFO:         name : SimpleDataSet
[2024/10/14 19:20:16] ppocr INFO:         transforms : 
[2024/10/14 19:20:16] ppocr INFO:             DecodeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 channel_first : False
[2024/10/14 19:20:16] ppocr INFO:                 img_mode : RGB
[2024/10/14 19:20:16] ppocr INFO:             VQATokenLabelEncode : 
[2024/10/14 19:20:16] ppocr INFO:                 algorithm : LayoutXLM
[2024/10/14 19:20:16] ppocr INFO:                 class_path : train_data/OCR_front/class_list_xfun.txt
[2024/10/14 19:20:16] ppocr INFO:                 contains_re : True
[2024/10/14 19:20:16] ppocr INFO:                 order_method : tb-yx
[2024/10/14 19:20:16] ppocr INFO:                 use_textline_bbox_info : True
[2024/10/14 19:20:16] ppocr INFO:             VQATokenPad : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:                 return_attention_mask : True
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenRelation : None
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenChunk : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:             TensorizeEntitiesRelations : None
[2024/10/14 19:20:16] ppocr INFO:             Resize : 
[2024/10/14 19:20:16] ppocr INFO:                 size : [512, 512]
[2024/10/14 19:20:16] ppocr INFO:             NormalizeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 mean : [128, 128, 128]
[2024/10/14 19:20:16] ppocr INFO:                 order : hwc
[2024/10/14 19:20:16] ppocr INFO:                 scale : 1
[2024/10/14 19:20:16] ppocr INFO:                 std : [60, 60, 60]
[2024/10/14 19:20:16] ppocr INFO:             ToCHWImage : None
[2024/10/14 19:20:16] ppocr INFO:             KeepKeys : 
[2024/10/14 19:20:16] ppocr INFO:                 keep_keys : ['input_ids', 'bbox', 'attention_mask', 'token_type_ids', 'entities', 'relations']
[2024/10/14 19:20:16] ppocr INFO:     loader : 
[2024/10/14 19:20:16] ppocr INFO:         batch_size_per_card : 8
[2024/10/14 19:20:16] ppocr INFO:         drop_last : False
[2024/10/14 19:20:16] ppocr INFO:         num_workers : 8
[2024/10/14 19:20:16] ppocr INFO:         shuffle : False
[2024/10/14 19:20:16] ppocr INFO: Global : 
[2024/10/14 19:20:16] ppocr INFO:     cal_metric_during_train : False
[2024/10/14 19:20:16] ppocr INFO:     epoch_num : 130
[2024/10/14 19:20:16] ppocr INFO:     eval_batch_step : [0, 19]
[2024/10/14 19:20:16] ppocr INFO:     infer_img : ./train_data/OCR_front/test/image
[2024/10/14 19:20:16] ppocr INFO:     kie_det_model_dir : None
[2024/10/14 19:20:16] ppocr INFO:     kie_rec_model_dir : None
[2024/10/14 19:20:16] ppocr INFO:     log_smooth_window : 10
[2024/10/14 19:20:16] ppocr INFO:     print_batch_step : 10
[2024/10/14 19:20:16] ppocr INFO:     save_epoch_step : 2000
[2024/10/14 19:20:16] ppocr INFO:     save_inference_dir : None
[2024/10/14 19:20:16] ppocr INFO:     save_model_dir : ./output/10_14/RE_front
[2024/10/14 19:20:16] ppocr INFO:     save_res_path : ./output/10_14/re/OCR/with_gt
[2024/10/14 19:20:16] ppocr INFO:     seed : 2022
[2024/10/14 19:20:16] ppocr INFO:     use_gpu : True
[2024/10/14 19:20:16] ppocr INFO:     use_visualdl : False
[2024/10/14 19:20:16] ppocr INFO: Loss : 
[2024/10/14 19:20:16] ppocr INFO:     key : loss
[2024/10/14 19:20:16] ppocr INFO:     name : LossFromOutput
[2024/10/14 19:20:16] ppocr INFO:     reduction : mean
[2024/10/14 19:20:16] ppocr INFO: Metric : 
[2024/10/14 19:20:16] ppocr INFO:     main_indicator : hmean
[2024/10/14 19:20:16] ppocr INFO:     name : VQAReTokenMetric
[2024/10/14 19:20:16] ppocr INFO: Optimizer : 
[2024/10/14 19:20:16] ppocr INFO:     beta1 : 0.9
[2024/10/14 19:20:16] ppocr INFO:     beta2 : 0.999
[2024/10/14 19:20:16] ppocr INFO:     clip_norm : 10
[2024/10/14 19:20:16] ppocr INFO:     lr : 
[2024/10/14 19:20:16] ppocr INFO:         learning_rate : 5e-05
[2024/10/14 19:20:16] ppocr INFO:         warmup_epoch : 10
[2024/10/14 19:20:16] ppocr INFO:     name : AdamW
[2024/10/14 19:20:16] ppocr INFO:     regularizer : 
[2024/10/14 19:20:16] ppocr INFO:         factor : 0.0
[2024/10/14 19:20:16] ppocr INFO:         name : L2
[2024/10/14 19:20:16] ppocr INFO: PostProcess : 
[2024/10/14 19:20:16] ppocr INFO:     name : VQAReTokenLayoutLMPostProcess
[2024/10/14 19:20:16] ppocr INFO: Train : 
[2024/10/14 19:20:16] ppocr INFO:     dataset : 
[2024/10/14 19:20:16] ppocr INFO:         data_dir : /home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/train/image
[2024/10/14 19:20:16] ppocr INFO:         label_file_list : ['/home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/train/train.json']
[2024/10/14 19:20:16] ppocr INFO:         name : SimpleDataSet
[2024/10/14 19:20:16] ppocr INFO:         ratio_list : [1.0]
[2024/10/14 19:20:16] ppocr INFO:         transforms : 
[2024/10/14 19:20:16] ppocr INFO:             DecodeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 channel_first : False
[2024/10/14 19:20:16] ppocr INFO:                 img_mode : RGB
[2024/10/14 19:20:16] ppocr INFO:             VQATokenLabelEncode : 
[2024/10/14 19:20:16] ppocr INFO:                 algorithm : LayoutXLM
[2024/10/14 19:20:16] ppocr INFO:                 class_path : train_data/OCR_front/class_list_xfun.txt
[2024/10/14 19:20:16] ppocr INFO:                 contains_re : True
[2024/10/14 19:20:16] ppocr INFO:                 order_method : tb-yx
[2024/10/14 19:20:16] ppocr INFO:                 use_textline_bbox_info : True
[2024/10/14 19:20:16] ppocr INFO:             VQATokenPad : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:                 return_attention_mask : True
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenRelation : None
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenChunk : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:             TensorizeEntitiesRelations : None
[2024/10/14 19:20:16] ppocr INFO:             Resize : 
[2024/10/14 19:20:16] ppocr INFO:                 size : [512, 512]
[2024/10/14 19:20:16] ppocr INFO:             NormalizeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 mean : [128, 128, 128]
[2024/10/14 19:20:16] ppocr INFO:                 order : hwc
[2024/10/14 19:20:16] ppocr INFO:                 scale : 1
[2024/10/14 19:20:16] ppocr INFO:                 std : [60, 60, 60]
[2024/10/14 19:20:16] ppocr INFO:             ToCHWImage : None
[2024/10/14 19:20:16] ppocr INFO:             KeepKeys : 
[2024/10/14 19:20:16] ppocr INFO:                 keep_keys : ['input_ids', 'bbox', 'attention_mask', 'token_type_ids', 'entities', 'relations']
[2024/10/14 19:20:16] ppocr INFO:     loader : 
[2024/10/14 19:20:16] ppocr INFO:         batch_size_per_card : 2
[2024/10/14 19:20:16] ppocr INFO:         drop_last : False
[2024/10/14 19:20:16] ppocr INFO:         num_workers : 4
[2024/10/14 19:20:16] ppocr INFO:         shuffle : True
[2024/10/14 19:20:16] ppocr INFO: 

[2024/10/14 19:20:16] ppocr INFO: ********** ser config **********
[2024/10/14 19:20:16] ppocr INFO: Architecture : 
[2024/10/14 19:20:16] ppocr INFO:     Backbone : 
[2024/10/14 19:20:16] ppocr INFO:         checkpoints : ./output/10_14/RE_front/best_accuracy
[2024/10/14 19:20:16] ppocr INFO:         mode : vi
[2024/10/14 19:20:16] ppocr INFO:         name : LayoutXLMForRe
[2024/10/14 19:20:16] ppocr INFO:         pretrained : True
[2024/10/14 19:20:16] ppocr INFO:     Transform : None
[2024/10/14 19:20:16] ppocr INFO:     algorithm : LayoutXLM
[2024/10/14 19:20:16] ppocr INFO:     model_type : kie
[2024/10/14 19:20:16] ppocr INFO: Eval : 
[2024/10/14 19:20:16] ppocr INFO:     dataset : 
[2024/10/14 19:20:16] ppocr INFO:         data_dir : /home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/test/image
[2024/10/14 19:20:16] ppocr INFO:         label_file_list : ['/home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/test/test.json']
[2024/10/14 19:20:16] ppocr INFO:         name : SimpleDataSet
[2024/10/14 19:20:16] ppocr INFO:         transforms : 
[2024/10/14 19:20:16] ppocr INFO:             DecodeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 channel_first : False
[2024/10/14 19:20:16] ppocr INFO:                 img_mode : RGB
[2024/10/14 19:20:16] ppocr INFO:             VQATokenLabelEncode : 
[2024/10/14 19:20:16] ppocr INFO:                 algorithm : LayoutXLM
[2024/10/14 19:20:16] ppocr INFO:                 class_path : train_data/OCR_front/class_list_xfun.txt
[2024/10/14 19:20:16] ppocr INFO:                 contains_re : True
[2024/10/14 19:20:16] ppocr INFO:                 order_method : tb-yx
[2024/10/14 19:20:16] ppocr INFO:                 use_textline_bbox_info : True
[2024/10/14 19:20:16] ppocr INFO:             VQATokenPad : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:                 return_attention_mask : True
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenRelation : None
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenChunk : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:             TensorizeEntitiesRelations : None
[2024/10/14 19:20:16] ppocr INFO:             Resize : 
[2024/10/14 19:20:16] ppocr INFO:                 size : [512, 512]
[2024/10/14 19:20:16] ppocr INFO:             NormalizeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 mean : [128, 128, 128]
[2024/10/14 19:20:16] ppocr INFO:                 order : hwc
[2024/10/14 19:20:16] ppocr INFO:                 scale : 1
[2024/10/14 19:20:16] ppocr INFO:                 std : [60, 60, 60]
[2024/10/14 19:20:16] ppocr INFO:             ToCHWImage : None
[2024/10/14 19:20:16] ppocr INFO:             KeepKeys : 
[2024/10/14 19:20:16] ppocr INFO:                 keep_keys : ['input_ids', 'bbox', 'attention_mask', 'token_type_ids', 'entities', 'relations']
[2024/10/14 19:20:16] ppocr INFO:     loader : 
[2024/10/14 19:20:16] ppocr INFO:         batch_size_per_card : 8
[2024/10/14 19:20:16] ppocr INFO:         drop_last : False
[2024/10/14 19:20:16] ppocr INFO:         num_workers : 8
[2024/10/14 19:20:16] ppocr INFO:         shuffle : False
[2024/10/14 19:20:16] ppocr INFO: Global : 
[2024/10/14 19:20:16] ppocr INFO:     cal_metric_during_train : False
[2024/10/14 19:20:16] ppocr INFO:     epoch_num : 130
[2024/10/14 19:20:16] ppocr INFO:     eval_batch_step : [0, 19]
[2024/10/14 19:20:16] ppocr INFO:     infer_img : /home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/test/image
[2024/10/14 19:20:16] ppocr INFO:     kie_det_model_dir : None
[2024/10/14 19:20:16] ppocr INFO:     kie_rec_model_dir : None
[2024/10/14 19:20:16] ppocr INFO:     log_smooth_window : 10
[2024/10/14 19:20:16] ppocr INFO:     print_batch_step : 10
[2024/10/14 19:20:16] ppocr INFO:     save_epoch_step : 2000
[2024/10/14 19:20:16] ppocr INFO:     save_inference_dir : None
[2024/10/14 19:20:16] ppocr INFO:     save_model_dir : ./output/10_14/RE_front
[2024/10/14 19:20:16] ppocr INFO:     save_res_path : ./output/10_14/re/OCR/with_gt
[2024/10/14 19:20:16] ppocr INFO:     seed : 2022
[2024/10/14 19:20:16] ppocr INFO:     use_gpu : True
[2024/10/14 19:20:16] ppocr INFO:     use_visualdl : False
[2024/10/14 19:20:16] ppocr INFO: Loss : 
[2024/10/14 19:20:16] ppocr INFO:     key : loss
[2024/10/14 19:20:16] ppocr INFO:     name : LossFromOutput
[2024/10/14 19:20:16] ppocr INFO:     reduction : mean
[2024/10/14 19:20:16] ppocr INFO: Metric : 
[2024/10/14 19:20:16] ppocr INFO:     main_indicator : hmean
[2024/10/14 19:20:16] ppocr INFO:     name : VQAReTokenMetric
[2024/10/14 19:20:16] ppocr INFO: Optimizer : 
[2024/10/14 19:20:16] ppocr INFO:     beta1 : 0.9
[2024/10/14 19:20:16] ppocr INFO:     beta2 : 0.999
[2024/10/14 19:20:16] ppocr INFO:     clip_norm : 10
[2024/10/14 19:20:16] ppocr INFO:     lr : 
[2024/10/14 19:20:16] ppocr INFO:         learning_rate : 5e-05
[2024/10/14 19:20:16] ppocr INFO:         warmup_epoch : 10
[2024/10/14 19:20:16] ppocr INFO:     name : AdamW
[2024/10/14 19:20:16] ppocr INFO:     regularizer : 
[2024/10/14 19:20:16] ppocr INFO:         factor : 0.0
[2024/10/14 19:20:16] ppocr INFO:         name : L2
[2024/10/14 19:20:16] ppocr INFO: PostProcess : 
[2024/10/14 19:20:16] ppocr INFO:     name : VQAReTokenLayoutLMPostProcess
[2024/10/14 19:20:16] ppocr INFO: Train : 
[2024/10/14 19:20:16] ppocr INFO:     dataset : 
[2024/10/14 19:20:16] ppocr INFO:         data_dir : /home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/train/image
[2024/10/14 19:20:16] ppocr INFO:         label_file_list : ['/home/user/1_xsz_ocr/PaddleOCR/train_data/OCR_front/train/train.json']
[2024/10/14 19:20:16] ppocr INFO:         name : SimpleDataSet
[2024/10/14 19:20:16] ppocr INFO:         ratio_list : [1.0]
[2024/10/14 19:20:16] ppocr INFO:         transforms : 
[2024/10/14 19:20:16] ppocr INFO:             DecodeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 channel_first : False
[2024/10/14 19:20:16] ppocr INFO:                 img_mode : RGB
[2024/10/14 19:20:16] ppocr INFO:             VQATokenLabelEncode : 
[2024/10/14 19:20:16] ppocr INFO:                 algorithm : LayoutXLM
[2024/10/14 19:20:16] ppocr INFO:                 class_path : train_data/OCR_front/class_list_xfun.txt
[2024/10/14 19:20:16] ppocr INFO:                 contains_re : True
[2024/10/14 19:20:16] ppocr INFO:                 order_method : tb-yx
[2024/10/14 19:20:16] ppocr INFO:                 use_textline_bbox_info : True
[2024/10/14 19:20:16] ppocr INFO:             VQATokenPad : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:                 return_attention_mask : True
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenRelation : None
[2024/10/14 19:20:16] ppocr INFO:             VQAReTokenChunk : 
[2024/10/14 19:20:16] ppocr INFO:                 max_seq_len : 512
[2024/10/14 19:20:16] ppocr INFO:             TensorizeEntitiesRelations : None
[2024/10/14 19:20:16] ppocr INFO:             Resize : 
[2024/10/14 19:20:16] ppocr INFO:                 size : [512, 512]
[2024/10/14 19:20:16] ppocr INFO:             NormalizeImage : 
[2024/10/14 19:20:16] ppocr INFO:                 mean : [128, 128, 128]
[2024/10/14 19:20:16] ppocr INFO:                 order : hwc
[2024/10/14 19:20:16] ppocr INFO:                 scale : 1
[2024/10/14 19:20:16] ppocr INFO:                 std : [60, 60, 60]
[2024/10/14 19:20:16] ppocr INFO:             ToCHWImage : None
[2024/10/14 19:20:16] ppocr INFO:             KeepKeys : 
[2024/10/14 19:20:16] ppocr INFO:                 keep_keys : ['input_ids', 'bbox', 'attention_mask', 'token_type_ids', 'entities', 'relations']
[2024/10/14 19:20:16] ppocr INFO:     loader : 
[2024/10/14 19:20:16] ppocr INFO:         batch_size_per_card : 2
[2024/10/14 19:20:16] ppocr INFO:         drop_last : False
[2024/10/14 19:20:16] ppocr INFO:         num_workers : 4
[2024/10/14 19:20:16] ppocr INFO:         shuffle : True
[2024/10/14 19:20:16] ppocr INFO: train with paddle 3.0.0-beta1 and device Place(gpu:0)
INFO:datasets:PyTorch version 2.4.1 available.
W1014 19:20:18.021422 715557 gpu_resources.cc:119] Please NOTE: device: 0, GPU Compute Capability: 7.0, Driver API Version: 12.2, Runtime API Version: 11.8
W1014 19:20:18.022987 715557 gpu_resources.cc:164] device: 0, cuDNN Version: 8.7.
[2024/10/14 19:20:21] ppocr INFO: resume from ./output/10_14/RE_front/best_accuracy
[2024/10/14 19:20:21] ppocr WARNING: The first GPU is used for inference by default, GPU ID: 0
[2024/10/14 19:20:22] ppocr WARNING: The first GPU is used for inference by default, GPU ID: 0
[2024-10-14 19:20:23,026] [    INFO] - Already cached /home/user/.paddlenlp/models/layoutxlm-base-uncased/sentencepiece.bpe.model
[2024-10-14 19:20:23,614] [    INFO] - tokenizer config file saved in /home/user/.paddlenlp/models/layoutxlm-base-uncased/tokenizer_config.json
[2024-10-14 19:20:23,617] [    INFO] - Special tokens file saved in /home/user/.paddlenlp/models/layoutxlm-base-uncased/special_tokens_map.json
[2024/10/14 19:20:26] ppocr INFO: resume from ./output/10_14/RE_front/best_accuracy
Traceback (most recent call last):
  File "./tools/infer_kie_token_ser_re.py", line 216, in <module>
    result = ser_re_engine(data)
  File "./tools/infer_kie_token_ser_re.py", line 147, in __call__
    ser_results, ser_inputs = self.ser_engine(data)
  File "/home/user/1_xsz_ocr/PaddleOCR/tools/infer_kie_token_ser.py", line 115, in __call__
    batch = transform(data, self.ops)
  File "/home/user/1_xsz_ocr/PaddleOCR/ppocr/data/imaug/__init__.py", line 72, in transform
    data = op(data)
  File "/home/user/1_xsz_ocr/PaddleOCR/ppocr/data/imaug/vqa/token/vqa_token_relation.py", line 31, in __call__
    relations = data["relations"]
KeyError: 'relations'

🏃‍♂️ Environment (运行环境)

paddlenlp                 2.5.2                    pypi_0    pypi
paddleocr                 2.8.1                    pypi_0    pypi
paddlepaddle-gpu          3.0.0b1                  pypi_0    pypi

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

完整的命令如下,在训练阶段没有异常,错误仅仅发生在推理阶段

conda activate ocr_pp

cd /home/user/1_xsz_ocr/PaddleOCR

python3 tools/train.py -c configs/kie/vi_layoutxlm/RE_front.yml

python3 ./tools/infer_kie_token_ser_re.py \
  -c  ./configs/kie/vi_layoutxlm/RE_front.yml\
  -o Architecture.Backbone.checkpoints=./output/10_14/RE_front/best_accuracy\
  Global.infer_img=./train_data/OCR_front/test/image\
  -c_ser ./configs/kie/vi_layoutxlm/RE_front.yml \
  -o_ser Architecture.Backbone.checkpoints=./output/10_14/RE_front/best_accuracy

justgoonz avatar Oct 14 '24 19:10 justgoonz