Shrinivasan T

Results 66 comments of Shrinivasan T

Hmm. Can not understood still about what is purge and how to do it diagrammatically. Will explore about and comment here later.

I too get similar issue for this file. shrinivasan@shrinivasan-laptop:~/dev/wiki/wiki2ocr-testing/test8$ python do_ocr.py INFO:**main**:Running do_ocr.py 1.53 INFO:root:Operating System = "Ubuntu 15.04" INFO:**main**:URL = https://upload.wikimedia.org/wikipedia/commons/e/ea/%E0%A6%AC%E0%A6%BF%E0%A6%B6%E0%A7%8D%E0%A6%AC%E0%A6%95%E0%A7%8B%E0%A6%B7_%E0%A6%B7%E0%A6%B7%E0%A7%8D%E0%A6%A0_%E0%A6%96%E0%A6%A3%E0%A7%8D%E0%A6%A1.djvu INFO:**main**:Columns = 1 INFO:**main**:Wiki Username = Tshrinivasan INFO:**main**:Wiki...

Some issue with google connectivity. Is this happening to all files? are the same files uploaded after some time?

I tried with the 1.32 version and got 100 files as text_for_page_00001.txt - text_for_page_00100.txt Try once again and share the results.

Can anyone create a wiki page for this with required details with sample data? I can extend that page with ocr operation details. We can use the same wiki user...

Is there any lengh limit in wiki for such tabular data? How can we add pagination? or is pagination is already there for lengthy wiki pages?

This is in future roadmap. Once the OCR started to run without a single issue, we can automate it completely. Now, we have to verify for script to run completely...

Explain in details with example. Are you updating any existing page with the mediawiki_uploader script? it just pastes the content from the OCR. It does not check for any existing...

The script does not add any header or footer. if you see that it adds them, share some example pages to compare and analysis.

The setup takes time when we want people to use their own gmail drive account for OCR. To simplify this, will create a gmail account, setup for api, and share...