nlp-topic-models icon indicating copy to clipboard operation
nlp-topic-models copied to clipboard

Application of topic models for topic extraction and similarity search

Natural Language Processing (NLP) using Topic Modeling

Application of topic model with special focus on German texts.

Datasets:

Algorithms:

  • TODO LSI - Latent Semantic Indexing (SVD)
  • LDA - Latent Dirichlet Allocation
  • TODO NMF - Non-negative Matrix Factorization

Tools:

Useful and inspirational resources

Topic Modeling Tutorials

About: Building, Evaluating, Visualizing Topic Models

Topic Models applied on Wikipedia

  • https://radimrehurek.com/gensim/wiki.html
  • https://www.kdnuggets.com/2017/11/building-wikipedia-text-corpus-nlp.html

Other NLP

  • https://github.com/adbar/German-NLP

Research

Data Sources

Bibliography

LDA

  • David M. Blei, Andrew Y. Ng, Michael I. Jordan. Latent Dirichlet Allocation. In: Journal of Machine Learning Research, 2003

Sentiment

  • R. Remus, U. Quasthoff & G. Heyer: SentiWS - a Publicly Available German-language Resource for Sentiment Analysis. In: Proceedings of the 7th International Language Ressources and Evaluation (LREC'10), pp. 1168-1171, 2010