amazon-textract-textractor
amazon-textract-textractor copied to clipboard
Textractor refactoring
This PR introduces a new way to use Textract and process its output in Python. It provides redesigned APIs for Text, Tables, Forms, Expense and AnalyseID to improve developer productivity, comprehensive documents, examples, tests and much more.
Features include:
- Calling all the Textract APIs (Synchronous and Asynchronous)
- Fuzzy string matching
- Directional finder
- Exporters in various formats (Tables to excel)
Documentation: https://aws-samples.github.io/amazon-textract-textractor/
Functionally, GitHub workflows were already added to recompile the documentation and run tests on merge. New versions are uploaded to PyPI automatically on release.
Aside for the new library, this PR removes the zip/ directory and moves the older Textractor README to the src/ directory.
You can get started by running pip install amazon-textract-textractor.