gcv2hocr icon indicating copy to clipboard operation
gcv2hocr copied to clipboard

gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.

Results 17 gcv2hocr issues
Sort by recently updated
recently updated
newest added

I had to manually specify the page_width and page_height to match my PDF images to get the words to align. I am sure the words are perfectly aligned by manually...

Currently in `gcv2hocr2.py`, the coordinate of the bounding box for `block`, `paragraph`, and `word` is created from their respective `boundingBox.normalizedVertices`: https://github.com/dinosauria123/gcv2hocr/blob/40adc1026fc10a0fbe746a0a26329d0e9bcd527a/gcv2hocr2.py#L123 https://github.com/dinosauria123/gcv2hocr/blob/40adc1026fc10a0fbe746a0a26329d0e9bcd527a/gcv2hocr2.py#L129 https://github.com/dinosauria123/gcv2hocr/blob/40adc1026fc10a0fbe746a0a26329d0e9bcd527a/gcv2hocr2.py#L135 Is it possible to create a new...

I have gone ahead and updated the script gcvocr.sh in order to accommodate for multi-threading. You can go ahead and view the source code here. https://gist.github.com/UBISOFT-1/4017d641c329159f8de3d203efc919e1 I am adding the...

According to the hOCR standard (Latest is v1.2 as of March 2021), the bbox property specifies `uint` to be used. That means all values must be unsigned. ([http://kba.cloud/hocr-spec/1.2/#propdef-bbox](http://kba.cloud/hocr-spec/1.2/#propdef-bbox)) However, the...

I'm working with the attached JSON file from GCV but when I run the gcv2ocr.py, the hocr only has metadata and lacks content. [osh-sample-1911a-0001.json.zip](https://github.com/dinosauria123/gcv2hocr/files/4689613/osh-sample-1911a-0001.json.zip)

When posting OCR request, we can choose two type of response. A TEXT_DETECTION response includes the detected phrase, its bounding box, and individual words and their bounding boxes: A DOCUMENT_TEXT_DETECTION...

Hi @dinosauria123! This is the issue I posted on ocr-fileformat: https://github.com/UB-Mannheim/ocr-fileformat/issues/121 As per your request I'm opening the issue here, copying the text: I have the JSON output of google...

1. save sample/jpn/jptest2.jpg as jptest2.pdf, 2. uploading to google vision (storage), and 3. generate output.json with `gcloud ml vision detect-text-pdf gs://my_bucket/input_file gs://my_bucket/out_put_prefix`, according to [text_detection_pdf](https://cloud.google.com/vision/docs/pdf#vision_text_detection_pdf_gcs-gcloud) 4. download output.json 4. gcv2hocr...

First off, thanks for an awesome piece of software. For the most part, it works great! For some reason, after converting many thousands of pages, I've come across this error...

I tried to convert the json output on Google's page using gcv2hocr.py: https://cloud.google.com/vision/docs/ocr Traceback (most recent call last): File "gcv2hocr2.py", line 146, in page = fromResponse(resp, **args.__dict__) File "gcv2hocr.py", line...