data-extraction topic
sypht-python-client
A python client for the Sypht API
sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
line-segmentation-algorithm-to-gcp-vision
Line segmentation algorithm for Google Vision API.
flash
Golang Keyword extraction/replacement Datastructure using Tries instead of regexes
cyac
High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python
sypht-java-client
A Java client for the Sypht API
ScrapeMate
Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.
format_parser
file metadata parsing, done cheap
newspaper3_usage_overview
This repository provides usage examples for the Python module Newspaper3k.
PlotDigitizer
A Python utility to digitize plots.