extract-text topic

List extract-text repositories

PDFR

36
Stars
3
Forks
Watchers

An R package to extract text from pdf.

twitter-text-php

114
Stars
21
Forks
Watchers

Twitter text processing library (auto linking and extraction of usernames, lists and hashtags). Based on the Ruby and Java implementations by Matt Sanford

OCR

28
Stars
0
Forks
Watchers

A collection of tools for OCR (optical character recognition).

antiword

57
Stars
4
Forks
Watchers

R wrapper for antiword utility

pdftron-document-search

41
Stars
12
Forks
Watchers

Build search across multiple documents client-side in your file storage

google-vision-api-for-ocr-demo

24
Stars
53
Forks
Watchers

Repo which contains a small demo to Extract Text from image OCR using Google Vision API in Python

Docotic.Pdf.Samples

69
Stars
39
Forks
Watchers

C# and VB.NET samples for Docotic.Pdf library

simple_NER

42
Stars
9
Forks
Watchers

simple rule based named entity recognition

tika-text-extract

15
Stars
4
Forks
Watchers

Extract text from a document by Apache Tika

pdftoroff

18
Stars
1
Forks
Watchers

view pdf on X11 and the Linux framebuffer; resize pdf; convert pdf to text, html, TeX, groff