chamanti_ocr_theano
chamanti_ocr_theano copied to clipboard
సొంత డేటా తో ట్రెయిన్ చెయ్యడం
_/_
నేను నా సొంత డేటా తో దీన్ని ట్రెయిన్ చేసి వాడొచ్చా?
ధన్యవాదాలు
మీరు అచ్చతెనుగులో అడిగారు కాబట్టి, నేను స్వచ్ఛాంగ్లాన ఒక పీడీఎఫ్ఫు అప్పలోడు చేస్తాను. రేపో మాపో. నెనర్లు రాకేశ్వర.
@gayatriMahesh What kind of data do you have? Right now I upgraded the system, so that it will generate the data on the fly using the scribe.py file. If you have sentences and corresponding labels, you can replace scribe.py to give one sample each time. That's it.
@gayatriMahesh If possible, can you share training data?
I have multiple volumes of bhagavatam which I wanted to digitize. If it works correctly there are other non profit organizations who want to use the same for some other old telugu texts. So I was trying to understand the end to end process. Right now I have some images.. but are not manually annotated yet
I have started a site to collect and digitize all telugu books. If you have any Telugu books, you can share them.
Hi,
I do have many old historic books like puranas ,upapuranas,ithihasa etc.., I am trying to digitize them, I initially tried tesseract with no luck,now iam trying to use ocropus engine,,,I need language model of telugu it would be great if you can help..Thank you.