PaddleOCR PPOCRV3文本识别精度低

请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem

系统环境/System Environment：windows
版本号/Version：Paddle：release/2.6
PaddleOCR：问题相关组件/Related components：ppocr_v3_rec
运行指令/Command Code：
python tools/infer/predict_system.py --image_dir="******" --det_model_dir="./output/save_det_inference/" --rec_model_dir="./output/save_rec_inference/Student" --rec_image_shape="3, 48, 320" --rec_char_dict_path="./ppocr/utils/ppocr_keys_v1.txt"
完整报错/Complete Error Message：

运用PPOCRV3并官方提供的识别模型，识别精度较低，基本无法使用。如下： 2019-02-19 2019-82-19 轻型平板货车轻型平板贷车

有没有什么精度更高的推理模型，或者有什么其他的提高精度的解决方案？

Oct 11 '22 07:10 engrwuang

如果推理图片只有文本行内容的话，不需要使用检测模型，会影响文本识别效果，建议尝试如下命令：

python tools/infer/predict_rec.py --image_dir="******" --rec_model_dir="./output/save_rec_inference/Student" --rec_image_shape="3, 48, 320" --rec_char_dict_path="./ppocr/utils/ppocr_keys_v1.txt"

Oct 11 '22 08:10 tink2123

如果推理图片只有文本行内容的话，不需要使用检测模型，会影响文本识别效果，建议尝试如下命令：
python tools/infer/predict_rec.py --image_dir="******" --rec_model_dir="./output/save_rec_inference/Student" --rec_image_shape="3, 48, 320" --rec_char_dict_path="./ppocr/utils/ppocr_keys_v1.txt"

我这边做的不只是文本行内容，需要检测+识别模型，但是官方提供的识别精度太低，识别效果较差，所以单独提出识别问题。

Oct 11 '22 08:10 engrwuang

这个中间带线的0确实识别效果很差，容易识别成6、8、9，车辆相关的证件上大部分是这样的0出现，如驾驶证/行驶证/车辆登记证等

Oct 12 '22 09:10 746891300-org

同问，驾驶证/行驶证/车辆登记证识别的精度很差，有什么解决方案吗？@tink2123

Oct 12 '22 12:10 guishilike

这个中间带线的0确实识别效果很差，容易识别成6、8、9，车辆相关的证件上大部分是这样的0出现，如驾驶证/行驶证/车辆登记证等

请问这种情况有好一些的解决方案吗？

Oct 17 '22 08:10 engrwuang

同问，驾驶证/行驶证/车辆登记证识别的精度很差，有什么解决方案吗？@tink2123

训练数据中缺少这种数据，可以使用 PaddleOCRLabel 标注一批数据，使用小数据集finetune一下，应该可以改善。

Oct 24 '22 06:10 tink2123

是的，finetune后效果好了一些

Oct 24 '22 07:10 guishilike

PaddleOCR PaddleOCR copied to clipboard

PPOCRV3文本识别精度低

PaddleOCR
PaddleOCR copied to clipboard