computational-linguistics topic

List computational-linguistics repositories

datastories-semeval2017-task4

196
Stars
63
Forks
Watchers

Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".

awesome-hungarian-nlp

210
Stars
18
Forks
Watchers

A curated list of NLP resources for Hungarian

openWordnet-PT

152
Stars
35
Forks
Watchers

OpenWordnet-PT: an open access wordnet for Portuguese

elpis

152
Stars
32
Forks
Watchers

πŸ™Š software for creating speech recognition models.

compling_nlp_hse_course

170
Stars
74
Forks
Watchers

ΠœΠ°Ρ‚Π΅Ρ€ΠΈΠ°Π»Ρ‹ курса ΠΏΠΎ ΠΊΠΎΠΌΠΏΡŒΡŽΡ‚Π΅Ρ€Π½ΠΎΠΉ лингвистикС Π¨ΠΊΠΎΠ»Ρ‹ Лингвистики НИУ Π’Π¨Π­

colibri-core

122
Stars
20
Forks
Watchers

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...

ucto

63
Stars
13
Forks
Watchers

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to...

folia

59
Stars
10
Forks
Watchers

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of li...

LaMachine

67
Stars
20
Forks
Watchers

LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilation/installation script

flat

108
Stars
15
Forks
Watchers

FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotat...