amazon-textract-textractor issues

Rotated documents are not visualized correctly

I suggest that as a starting point we: - Use the polygon in order to visualize the text bounding box, print the text horizontally

ThomasDelteil

bug

Improve Table visualization

We want to improve visualization and support of Tables features: - [ ] Header cells

ThomasDelteil

Improve AnalyzeID support

In order to improve AnalyzeID support we want to: - [ ] Improve visualization to show-case the summary fields - [ ] Improve support of summary fields such that they...

ThomasDelteil

Tables exported to Excel with Textractor display an error when opened in Excel

When exporting the API output of Textract Tables, an error will be shown when opening the resulting `.xlsx` file in Microsoft Excel.

Belval

bug

Add support for AnalyzeLending in Textractor

https://github.com/aws-samples/amazon-textract-textractor/issues/134 was merged and the underlying caller now support AnalyzeLending. We need to add it to Textractor.

Belval

enhancement

Improve Table Indexing

There is some limited support for table indexing such as: ``` new_table = document.tables[0][:5, :] ``` In order to select the first 5 rows of a given table. However we...

ThomasDelteil

Remove calls to Textract from prettyprinter tests

Tests for prettyprinter call Textract directly instead of using JSON ``` def test_pretty_with_tables(): features = [Textract_Features.FORMS, Textract_Features.TABLES] textract_client = boto3.client('textract', region_name='us-east-2') response = call_textract(input_document="s3://amazon-textract-public-content/blogs/w2-example.png", features=features, boto3_textract_client=textract_client) assert response tables_result =...

schadem

Move Integration Testing to general account

Move the integration testing for the caller and textractor to a different account than 913165245630

schadem

TGeoFinder should have a method to reset the in memory sqlite database

1

Hi @Belval 1. I find out that every time when you create the [TGeoFinder](https://github.com/aws-samples/amazon-textract-textractor/blob/master/tpipelinegeofinder/textractgeofinder/tgeofinder.py#L51) class from the JSON data, you actually generate a uuid for this object and insert lot...

MacHu-GWU

Visualize document.expense_documents

It would be great if we could visualize expense_documents and the associated normalized summary fields directly on the document as well, similarly as to how we currently visualize KV containers...

ThomasDelteil

enhancement

amazon-textract-textractor
amazon-textract-textractor copied to clipboard

Metadata

Rotated documents are not visualized correctly

Improve Table visualization

Improve AnalyzeID support

Tables exported to Excel with Textractor display an error when opened in Excel

Add support for AnalyzeLending in Textractor

Improve Table Indexing

Remove calls to Textract from prettyprinter tests

Move Integration Testing to general account

TGeoFinder should have a method to reset the in memory sqlite database

Visualize document.expense_documents

← Metadata

Owner

Metadata

amazon-textract-textractor amazon-textract-textractor copied to clipboard

Metadata

← Metadata

Owner

Metadata

amazon-textract-textractor
amazon-textract-textractor copied to clipboard