PaddleOCR issues

印章弯曲文本识别

12

![1](https://user-images.githubusercontent.com/113701415/192666315-0da15968-528e-4784-a2ef-a231d5da8710.JPG) ![2](https://user-images.githubusercontent.com/113701415/192666319-df2d94d6-2a7c-48ea-951d-7467ab356d0c.JPG) 小批量数据使用det_r50_db++_icdar15.yml，进行训练（使用的是ResNet50_dcn_asf_synthtext_pretrained.pdparams预训练模型），loss最终只能降到0.4左右，拿训练集直接进行评估的，recall和precision一直低于0.1，在配置文件部分也加了use_polygon: true该参数,标注格式如下(弯曲部分均采用16点或8点均匀标注，倾斜文本采用4点标注），想问一下如何解决该问题，完全达不到官方教程里光滑识别区域的效果。 ![3](https://user-images.githubusercontent.com/113701415/192667497-f1267d65-725a-44fc-ab69-f26bd86aa33e.JPG)

mingming0611

fix incompatibility with old ocr function

2

兼容 #7834 修改前的 ocr 函数：当参数 img 为非数组时返回非数组的结果。

YongJie-Xie

contributor

增值税发票文本检测，基于PPOCRv3轻量检测模型的finetune训练，训练精度非常低

5

场景介绍：增值税发票文本检测 1、首先使用ppocrLabel标注工具对数据进行标注（自动标注后，主要是将一些文字间距较大的字段的标注做修改，多个文本区域进行合并）； ![BD2210AB878C0AFD1FCE5975C011354A](https://user-images.githubusercontent.com/41052224/196595396-8207a086-a3f0-4c7c-8f6d-7d35e24ae765.png) 2、基于PPOCRv3轻量检测模型的finetune训练，训练精度非常低 - 系统环境/System Environment：ubuntu18.4，python3.7 - 版本号/Version：Paddle：2.2.0-gpu 使用ch_PP-OCRv3_det_distill_train的student预训练模型进行finetune - 运行指令： python3 -m paddle.distributed.launch --gpus '0,1,2,3' tools/train.py -c configs/det/ch_PP-OCRv3/ch_PP-OCRv3_det_student.yml \ -o Global.pretrained_model=/home/models/pretrained_models/ch_PP-OCRv3_det_distill_train/student \ Global.save_model_dir=/home/models/train_models/det_models/ppv3 - 问题：训练精度随epoch增长逐渐降低 ![1297EE89D1D1F9D24C28A6B94CDC2541](https://user-images.githubusercontent.com/41052224/196595692-35f812e0-3d90-41f1-aa66-09ae6e5865f2.png) ![E880B4583B8C3E534684AECC5A374104](https://user-images.githubusercontent.com/41052224/196595708-b20d3f98-5095-4b61-93eb-8d89daebd874.png) ![2F12569671AAB6EBE599BFDA5BA48C1D](https://user-images.githubusercontent.com/41052224/196595913-67ada3c1-1db1-4c2a-8cf9-f38e27e927df.png)...

sybest1259

Can't extract tar file in ch_ppocr_mobile_v2.0_cls_infer.tar

1

请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem - 系统环境/System Environment：Ubuntu - 版本号/Version：Paddle： PaddleOCR：问题相关组件/Related components： - 运行指令/Command Code： - 完整报错/Complete Error Message：I can't extract Text Angle Classification...

monkiravn

use custom trained layout analysis model

4

请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem - 系统环境/System Environment： ubuntu 18 - 版本号/Version： Paddle：2.3.0 PaddleOCR：问题相关组件/Related components： 2.6.0.1 - 运行指令/Command Code： - from paddleocr import PPStructure...

Mahmuod1

识别模型微调finetune相关的问题

1

训练V3版本是文字识别模型。场景是身份证。当前下载了部分开源的中文识别的数据集有200多万张。自己生成的中文数据集有40万张左右，标注好的真实身份证的图片有10万张左右。用V3官方提供的中文推理模型测试身份证，有些不清晰的识别效果不好。因此使用了真实图片进行标注。想使用ch_PP-OCRv3_rec_train/best_accuracy.pdparams 预训练模型进行finetune。请问：：使用官方的这个中文识别训练模型微调，应该是用以下那两种方法好呢？ 1. 加入我找的这200万张开源中文数据集，然后将 ratio_list：设置成真实+生成和开源数据 1：1 2. 不需要加入我找的开源数据，直接使用生成的数据和真实数据进行finetune

ainndejj11

recognition

SVTR配置文件

1

请问SVTR系列的small base large有配置文件，我看到只提供了tiny的，没有其他三个，如果我export_model时直接用tiny的配置文件导出其他三个模型会提示模型和预训练参数不一致。

peiwenYe

车牌检测这类，需要用SVTR嘛

4

车牌这类基本都是没有什么上下文的吧，下一个字符是什么完全不可预测。像RNN或是SVTR这类真的有必要嘛，看intel做的LPRNet用了个简单的13*1的卷积直接代替了。有用Paddle实现LPRNet的嘛

nistarlwc

弯曲中文文字识别验证一直为0

7

请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem - 系统环境/System Environment：Ubuntu 18.04.5 - 版本号/Version：Paddle：gpu-2.3.0.post112 PaddleOCR：2.6.0.1 - 问题相关组件/Related components：端到端弯曲印章中文文本识别- ![image](https://user-images.githubusercontent.com/44218061/196322666-cf273203-2bbd-48da-8a33-7c2836576d54.png) - 运行指令/Command Code：python tools/train.py -c configs/e2e/e2e_r50_vd_pg.yml - 完整报错/Complete...

lai-serena

修复多进程下ModuleNotFoundError: No module named 'paddleocr.ppstructure'的问题

1

原有实现中，同名Module（paddleocr）下使用同名的Python主入口脚本（paddleocr.py），使用spawn多进程，在Python3.8触发ModuleNotFoundError的bug，该bug在测试环境（Python3.8.15 Paddle2.2.2 PaddleOCR2.6.0.1）可以稳定复现 CASE代码： ```python from paddleocr.ppstructure.predict_system import StructureSystem import multiprocessing def func( pid, ): print(f"[{pid}] Start") [_ for _ in range(int(1e6))] print(f"[{pid}] Finished") if __name__ == '__main__': with multiprocessing.Manager()...

hermitgreen

contributor

status: proposed

PaddleOCR
PaddleOCR copied to clipboard

Metadata

印章弯曲文本识别

fix incompatibility with old ocr function

增值税发票文本检测，基于PPOCRv3轻量检测模型的finetune训练，训练精度非常低

Can't extract tar file in ch_ppocr_mobile_v2.0_cls_infer.tar

use custom trained layout analysis model

识别模型微调finetune相关的问题

SVTR配置文件

车牌检测这类，需要用SVTR嘛

弯曲中文文字识别验证一直为0

修复多进程下ModuleNotFoundError: No module named 'paddleocr.ppstructure'的问题

← Metadata

Owner

Metadata

PaddleOCR PaddleOCR copied to clipboard

Metadata

← Metadata

Owner

Metadata

PaddleOCR
PaddleOCR copied to clipboard