alpr-unconstrained
alpr-unconstrained copied to clipboard
problems about ocr dataset
Hello, I'm now trying to train the ocr model using my dataset, but the trained model just doesn't work. I want to know if my annotations are right. Every LP image has several bounding boxes. There's an example:
- class_id x y w h 33 0.0970149253731 0.443069306931 0.0985074626866 0.509900990099 16 0.222388059701 0.457920792079 0.10447761194 0.490099009901 9 0.361194029851 0.457920792079 0.113432835821 0.490099009901 32 0.597014925373 0.487623762376 0.107462686567 0.519801980198 2 0.741791044776 0.502475247525 0.110447761194 0.509900990099 16 0.877611940299 0.512376237624 0.101492537313 0.509900990099 I just use darknet to train your ocr network, so could you give me some hints? Plus, can you tell me how many training LP images you used for ocr network?
You need to put the txt file and the picture in the same directory for training. @heuDavidJia
Take a look at #19 , even with correct labels you will not be able to train the network with Darknet as is. You need to modify the data augmentation part to be less aggressive.
@sergiomsilva Thanks for replying. I will try to modify the data augmentation part.
@linzhi123 I guess the directory of dataset is right, but there are some problems with my data. Have you trained the model successfully?
What do you think is wrong with the data? It's okay for me to use the author's network to train the model, and I only used 1,000 pictures to train, and the effect is good.
@linzhi123 I guess i have find the problem, the annotation tool i used has a big bug. Do you train the network with Darknet directly or with modified data augmentation part?
Direct use, no modification
@linzhi123 Have you changed the anchors for your own data? And should i resize my data to 240*80 before training?Cause i always think the darknet has completed the resize work.
@linzhi123 are you using actual images or synthetically generated? I started training it on synthetic data, the accuracy on val and test data is about 99% but on real data it's not not giving good accuracy, and it's not even robust on one frame it's giving different output and on another it's different of the same plate. while after 10k iterations IOU is about 0.9, class score id 0.99, objectness is 0.9 and recall 1.0. Any hint and suggestion will be appreciated.
I did not change the data to a fixed size @heuDavidJia
Of the 1,000 images I used, 100 were real and 900 were composite. @danishansari
I want to run this code on video please guide me on how to implement it...
@linzhi123, could you provide your train data (1,000 pictures to train),thanks.
Of the 1,000 images I used, 100 were real and 900 were composite. @danishansari
Hi,What method do you use to mark Chinese license plates?I have some problems ,Thank you
我没有将数据更改为固定大小@heuDavidJia
Which project are you based on to implement OCR, can you give a link