saffsd

Results 7 repositories owned by saffsd

geniatagger

19
Stars
16
Forks
Watchers

- part-of-speech tagging, shallow parsing, and named entity recognition for biomedical text -

kaggle-stackoverflow2012

46
Stars
33
Forks
Watchers

My entry to the Kaggle 2012 Stack Overflow competition. Ranked 10th on the final public leaderboard.

kaggle-stumbleupon2013

15
Stars
6
Forks
Watchers

My entry to the Kaggle 2013 StumbleUpon competition. Ranked 4th on the final private leaderboard.

langid.c

21
Stars
10
Forks
Watchers

Pure C natural language identifier with support for 97 languages

langid.py

2.0k
Stars
297
Forks
Watchers

Stand-alone language identification system

polyglot

29
Stars
5
Forks
Watchers

Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the languages therein.

wikidump

42
Stars
17
Forks
Watchers

Tools to manipulate and extract data from wikipedia dumps