unilm icon indicating copy to clipboard operation
unilm copied to clipboard

[TrOCR] How to train TRocr on custom dataset of different language

Open kasuba-badri-vishal opened this issue 2 years ago • 1 comments

Hi, I want to change the Decoder part of TRocr to train and infer on different vocabulary [i.e different language]. I was following sample implementation from here but I was not able to change the vocabulary but just the size of vocabulary. It would be really helpful to know how can I change vocabulary for the TRocr decoder part.

Thanks

kasuba-badri-vishal avatar Oct 06 '22 08:10 kasuba-badri-vishal

+1

Mohammed20201991 avatar Dec 01 '22 16:12 Mohammed20201991

hope to get some feedback.

bit-scientist avatar Aug 29 '23 08:08 bit-scientist