Kavita Ganesan

Results 7 repositories owned by Kavita Ganesan

nlp-in-practice

1.1k
Stars
782
Forks
Watchers

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre...

opinosis-summarization

51
Stars
18
Forks
Watchers

This repo contains code and dataset for the Opinosis Summarization Framework

ROUGE-2.0

206
Stars
37
Forks
Watchers

ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.

phrase-at-scale

125
Stars
45
Forks
Watchers

Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

clinical-concepts

25
Stars
11
Forks
Watchers

Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes...

OpinRank

41
Stars
11
Forks
Watchers

OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)

word_cloud

48
Stars
14
Forks
Watchers

Python word cloud library for use within Jupyter notebook and Python apps.