lda-c
lda-c copied to clipboard
What parameters were used to generate the example output?
I would like to reproduce the the results in https://github.com/Blei-Lab/lda-c/blob/master/example/ap-topics.pdf however I don't know what settings.txt alpha and seedings were used. Can you please help?
I also notice that some of the preprocessing steps of moving from ap.txt
to ap.dat
are missing. What stop words were removed? What tokenization processed was used. This stuff is important for proper reproducibility.
Have a look at this. Stackoverflow