David S. Batista
David S. Batista
text-classification
An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines
NER-Evaluation
An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity
BREDS
"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in EMNLP'15 - Python implementation
Snowball
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
Annotated-Semantic-Relationships-Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
Aspect-Based-Sentiment-Analysis
Aspect-Based Sentiment Analysis Experiments
NER-datasets
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
awesome-Portuguese-NLP
A list of libraries and NLP projects for Portuguese
ConvNets-for-Sentence-Classification
"Convolutional Neural Networks for Sentence Classification" (Kim 2014) - https://www.aclweb.org/anthology/D14-1181
information-extraction-PT
An example of triples extraction with PoS-tags using ReVerb