spark-LDA-example
spark-LDA-example copied to clipboard

Published 20 hours ago •

→

Metadata

A simple Spark LDA example. to demonstrate a full fletched clustering algorithm, with data cleaning using the processess like lemmatization , stemming etc.

Readme
Issues

spark-LDA-example

A simple Spark LDA example. This project contains a basic Document Clustering example in which data cleaning is also done.

We are going to perform these procedures for the document clustering, these steps include:

Spark RegexTokenizer : For Tokenization
Stanford NLP Morphology : For Stemming and lemmatization
Spark StopWordsRemover : For removing stop words and punctuation
Spark TF-IDF : For computing term frequencies or tf-idf
Spark LDA : For Clustering of documents.

About

A simple Spark LDA example. to demonstrate a full fletched clustering algorithm, with data cleaning using the processess like lemmatization , stemming etc.

Stars

Forks

Watchers

Owner

shiv4nsh

← Metadata

Stars

Forks

Watchers

Owner

shiv4nsh

Metadata

A simple Spark LDA example. to demonstrate a full fletched clustering algorithm, with data cleaning using the processess like lemmatization , stemming etc.

Back

spark-LDA-example spark-LDA-example copied to clipboard

Metadata

spark-LDA-example

← Metadata

Owner

Metadata

spark-LDA-example
spark-LDA-example copied to clipboard