MaskTextSpotter train for long text line detection and recognition

train for long text line detection and recognition

Open JimmyJuan opened this issue 4 years ago • 4 comments

Hello @MhLiao , my scenario is mostly long text line included. Should I retrain the model by changing the parameter "MODEL.ROI_MASK_HEAD.POOLER_RESOLUTION_W" from 64 to wider, say 128 or 196 (with the change of MODEL.ROI_MASK_HEAD.RESOLUTION_W from 128 to 256/384).

If so, should I train a new model from scratch with new setting? How to do it ?

Thank you.

Jun 04 '20 09:06 JimmyJuan

@JimmyJuan Yes, you need to change the mentioned setting and retrain it. If you do not have enough training data, you'd better use SynthText for pretraining.

Jun 15 '20 03:06 MhLiao

@MhLiao Thank you.

I'm training a new pre-trained model using only SynthText data. But the loss at the beginning was as low as about 0.6, and then went up to 1.5, and gradually decrease. Is that normal? And for the detection branch, the scale of the characters was different from my real data, is the SynthText pretraining still effective?

Jun 15 '20 03:06 JimmyJuan

@JimmyJuan The loss should be normal if it gradually decreases. It goes up to a high value because the positive proposals increase while there are almost no positive proposals at the beginning. I think that pretraining may be not as effective as expected, but it should not harm the training.

Jun 15 '20 03:06 MhLiao

@MhLiao Thanks for the replying.

Jun 15 '20 05:06 JimmyJuan

MaskTextSpotter MaskTextSpotter copied to clipboard

train for long text line detection and recognition

MaskTextSpotter
MaskTextSpotter copied to clipboard