amazon-textract-textractor icon indicating copy to clipboard operation
amazon-textract-textractor copied to clipboard

Analyze documents with Amazon Textract and generate output in multiple formats.

Results 129 amazon-textract-textractor issues
Sort by recently updated
recently updated
newest added

I can see support for exporting to csv, Pandas DataFrame and xls here: https://aws-samples.github.io/amazon-textract-textractor/notebooks/table_data_to_various_formats.html Is there a way to export the table data to json? Or do I have to...

*Issue #, if available:* [#165](https://github.com/aws-samples/amazon-textract-textractor/issues/165) *Description of changes:* Previously, customers had the ability to convert the API response JSON to Document object to edit its contents. Support has now been...

*Issue #, if available:* #170 *Description of changes:* The original issue was that word and line bounding boxes were shifted in some cases when page width or page height !=...

start_document_analysis in the documentation says it supports a list of PIL images, but in the source code https://github.com/aws-samples/amazon-textract-textractor/blob/e40f5b0378f9ee24d0a757de414505fb06a4471f/textractor/textractor.py#L488 it only accepts a string, a bytearray, or a PIL Image. How...

enhancement

Currently there is limited support for AnalyzeExpense in Textractor. We support sync and async API calls. However we need to implement the following: - [x] Allow duplication of KV for...

As a prior to the geofinder feature, we need the ability to export edited entities back into the Textract API response format. In the image below, the customer would be...

Current Implementation: Task: - [ ] Get feature requirements from stakeholders

The conversion to trp2 is based on using the initial response. This does not capture the any modifications made to the entities like OCR post-processing or correction or deletion of...

enhancement

Image shows multi-column text for which the Textract returns words with bounding box information. Aim: Support export/pretty print retaining the spaces shown in the document i.e print digital text in...

It is often possible to improve results of the final processing by performing adjustements on the input image. We want to provide a helper library such that it is easy...