gateplugin-LearningFramework
gateplugin-LearningFramework copied to clipboard
Add support for topic models by wrapping gensim
trafficstars
This will need an even simpler "corpus representation" for text (list of tokens) only.
A quicker way and easier to use may be to use Mallet's LDA module. This requires a different Mallet Corpus Representation (based on FeatureSequences and no targets) and may not scale as well as using an OOM representation with a GenSim backend, but would not need any additional software installed and automatically port to Windows.