corpus-processing topic
Wordless
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
bitextor
Bitextor generates translation memories from multilingual websites
TreebankPreprocessing
Python scripts preprocessing Penn Treebank and Chinese Treebank
OpusFilter
OpusFilter - Parallel corpus processing toolkit
OPIEC
Reading the data from OPIEC - an Open Information Extraction corpus
corpuslingr
A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction.
alvisnlp
ALvisNLP corpus processing engine
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
StringAnalysis.jl
Hard-Forked from JuliaText/TextAnalysis.jl