extract-text topic
PDFR
An R package to extract text from pdf.
twitter-text-php
Twitter text processing library (auto linking and extraction of usernames, lists and hashtags). Based on the Ruby and Java implementations by Matt Sanford
OCR
A collection of tools for OCR (optical character recognition).
pdftron-document-search
Build search across multiple documents client-side in your file storage
google-vision-api-for-ocr-demo
Repo which contains a small demo to Extract Text from image OCR using Google Vision API in Python
Docotic.Pdf.Samples
C# and VB.NET samples for Docotic.Pdf library
simple_NER
simple rule based named entity recognition
tika-text-extract
Extract text from a document by Apache Tika
pdftoroff
view pdf on X11 and the Linux framebuffer; resize pdf; convert pdf to text, html, TeX, groff