TextBoxes Some questions about learning process

Some questions about learning process

Open seovchinnikov opened this issue 7 years ago • 2 comments

Hey! Thank you for your paper. I would like to ask a couple of questions:

What is magnitude of the corresponding train/val loss in the end of the training on icdar13 when fscore of ~ 0.8 is reached?
Does it matter how to name the text class if its the only one in xmls?
How to debug caffe's image generator? I mean to look at warped images, for example

Thank you in advance

Jan 19 '18 07:01 seovchinnikov

You can watch the detection_eval to get a good model.
It should match the class in your "labelmap" file
I am sorry that I have no idea.

Jan 19 '18 11:01 MhLiao

Thank you!

Im trying to finetune on my small dataset ~ 1000 pictures but got detection_eval 0.11 after 500 iterations (iter_size=8, batch=4) and it does not get better after 1000... It seems like smth wrong with my data or augmentation for this data... I have vertical images about 128px width and 350px+-50px height with text lines aligned in the center of image. It seems like I need to play around with batch_sampler. I think this one is more appropriate for my case:

'sampler': {
                        'min_scale': 0.6,
                        'max_scale': 1.0,
                        'min_aspect_ratio': 0.2,
                        'max_aspect_ratio': 0.7,
},

By the way, does it work with negative images without any text? I put it do my dataset as well. Is it ok?

Jan 19 '18 13:01 seovchinnikov

TextBoxes TextBoxes copied to clipboard

Some questions about learning process

TextBoxes
TextBoxes copied to clipboard