MaskTextSpotter icon indicating copy to clipboard operation
MaskTextSpotter copied to clipboard

train for long text line detection and recognition

Open JimmyJuan opened this issue 4 years ago • 4 comments

Hello @MhLiao , my scenario is mostly long text line included. Should I retrain the model by changing the parameter "MODEL.ROI_MASK_HEAD.POOLER_RESOLUTION_W" from 64 to wider, say 128 or 196 (with the change of MODEL.ROI_MASK_HEAD.RESOLUTION_W from 128 to 256/384).

If so, should I train a new model from scratch with new setting? How to do it ?

Thank you.

JimmyJuan avatar Jun 04 '20 09:06 JimmyJuan

@JimmyJuan Yes, you need to change the mentioned setting and retrain it. If you do not have enough training data, you'd better use SynthText for pretraining.

MhLiao avatar Jun 15 '20 03:06 MhLiao

@MhLiao Thank you.

I'm training a new pre-trained model using only SynthText data. But the loss at the beginning was as low as about 0.6, and then went up to 1.5, and gradually decrease. Is that normal? And for the detection branch, the scale of the characters was different from my real data, is the SynthText pretraining still effective?

JimmyJuan avatar Jun 15 '20 03:06 JimmyJuan

@JimmyJuan The loss should be normal if it gradually decreases. It goes up to a high value because the positive proposals increase while there are almost no positive proposals at the beginning. I think that pretraining may be not as effective as expected, but it should not harm the training.

MhLiao avatar Jun 15 '20 03:06 MhLiao

@MhLiao Thanks for the replying.

JimmyJuan avatar Jun 15 '20 05:06 JimmyJuan