textract topic
aws-tutorial-code
AWS tutorial code.
doc2audiobook
Convert text documents to high fidelity audio(books).
code4goal-resume-parser
Solution for Code4Goal challenge
aws-pdf-textract-pipeline
:mag: Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Textractor
一个高效的从HTML中提取正文的类库。An efficient class library for extracting text from HTML.
wagtail_textract
Text extraction for Wagtail document search
ocr-python
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
quipucamayoc
dev repo for article