python-documentai-toolbox
python-documentai-toolbox copied to clipboard
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from...
[Policy Bot](https://github.com/googleapis/repo-automation-bots/tree/main/packages/policy#policy-bot) found one or more issues with this repository. - [x] Default branch is 'main' - [x] Branch protection is enabled - [x] Merge commits disabled - [x] There...
This PR contains the following updates: | Package | Type | Update | Change | Age | Confidence | |---|---|---|---|---|---| | [actions/checkout](https://redirect.github.com/actions/checkout) | action | major | `v4` -> `v5`...
- Introduced `confidence` property to represent the confidence score of Token detection. - Added `detected_languages` property to provide a list of detected languages for each Token. - Updated unit tests...
Thanks for stopping by to let us know something could be better! **PLEASE READ**: If you have a support contract with Google, please create an issue in the [support console](https://cloud.google.com/support/)...
See the samples test failure in PR https://github.com/googleapis/python-documentai-toolbox/pull/380 https://github.com/googleapis/python-documentai-toolbox/blob/0e59848a43b486fd61e022e48f08b93104076eb0/google/cloud/documentai_toolbox/utilities/gcs_utilities.py#L145 https://btx.cloud.google.com/invocations/41ddcb73-85dd-4eda-80d5-052e7c29805a/log ``` =================================== FAILURES =================================== ________________________ test_quickstart_sample_gcs_uri ________________________ capsys = def test_quickstart_sample_gcs_uri(capsys: pytest.CaptureFixture) -> None: gcs_uri = ( "gs://documentai_toolbox_samples/output/123456789/0/toolbox_invoice_test-0.json" ) >...
There is a vulnerability in pyarrow
:robot: I have created a release *beep* *boop* --- ## [0.14.3-alpha](https://github.com/googleapis/python-documentai-toolbox/compare/v0.14.2-alpha...v0.14.3-alpha) (2025-03-14) ### Bug Fixes * Remove setup.cfg configuration for creating universal wheels ([#383](https://github.com/googleapis/python-documentai-toolbox/issues/383)) ([b1f3772](https://github.com/googleapis/python-documentai-toolbox/commit/b1f3772dd2f7fa76fc2bb8621282287f2066d644)) --- This PR was generated...
I'm trying to use Document AI to generate a searchable PDF out of an input documents. Given the marketing around Document AI and the availability of a pretrained "Document OCR"...
The provided conversion script (https://github.com/googleapis/python-documentai-toolbox/blob/d29ff95742269a95e1e96e047f0fa1268457292a/samples/snippets/convert_external_annotations_sample.py) seems to only support single-page documents. Uploading multi-page documents results in all bounding boxes appearing on the first page in the DocumentAI UI, despite correct...