mmocr SVTR复现问题

SVTR复现问题

Open Topdu opened this issue 1 year ago • 2 comments

Model/Dataset/Scheduler description

非常感谢MMOCR收录SVTR！目前复现结果与论文存在不小的差距主要存在以下的问题： 1、数据集 SVTR使用与ABINet同样的数据集 2 、数据增强 MMOCR复现SVTR使用的数据增强与原论文使用的数据增强存在较大的diff，这也是造成结果差距大的主要原因 3、学习率和Batchsize SVTR默认使用4卡GPU训练，单个GPU 的batchsize为512，总的batchsize为2048，对应的学习率为0.0005 4、优化器的weight decay SVTR原代码训练时，在PaddleOCR中使用了

  no_weight_decay_name: norm pos_embed
  one_dim_param_no_weight_decay: true

以上AdamW优化的参数设置weight decay

其他有关SVTR训练细节欢迎在PaddleOCR新建issue讨论～

Open source status

[ ] The model implementation is available
[ ] The model weights are available.

Provide useful links for the implementation

No response

Jan 15 '23 05:01 Topdu

mmocr mmocr copied to clipboard

SVTR复现问题

Model/Dataset/Scheduler description

Open source status

Provide useful links for the implementation

mmocr
mmocr copied to clipboard