computational-linguistics topic
datastories-semeval2017-task4
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".
awesome-hungarian-nlp
A curated list of NLP resources for Hungarian
openWordnet-PT
OpenWordnet-PT: an open access wordnet for Portuguese
elpis
π software for creating speech recognition models.
compling_nlp_hse_course
ΠΠ°ΡΠ΅ΡΠΈΠ°Π»Ρ ΠΊΡΡΡΠ° ΠΏΠΎ ΠΊΠΎΠΌΠΏΡΡΡΠ΅ΡΠ½ΠΎΠΉ Π»ΠΈΠ½Π³Π²ΠΈΡΡΠΈΠΊΠ΅ Π¨ΠΊΠΎΠ»Ρ ΠΠΈΠ½Π³Π²ΠΈΡΡΠΈΠΊΠΈ ΠΠΠ£ ΠΠ¨Π
colibri-core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...
ucto
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to...
folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of li...
LaMachine
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script
flat
FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotat...