Results 123 comments of Minchul Lee

Thank you for reporting a potential bug. I'll examine your code and data and figure out why PYTHONHASHSEED affects the results.

Hi @gonzalezf, nice question! It's actually simple. When initializing the LDA model, each word is assigned to a random subject with an uniform distribution. `set_word_prior` manipulates this distribution to be...

Hi @Raymond-KT As far as I know there are no Python tools to visualize hierarchical topic models. So I recommend to extract the hierarchy of topics and its words from...

Hi @hhagedorn, Sorry for the confusion due to the unclear documentation. For `targets`, the latter is correct. In other words, you just pass a set of words in to-be evaluated...

Hi @benreaves There appears to be some bugs in the current implementation of `tomotopy.coherence`. However, a similar situation was not reproduced in my test set, so it is difficult to...

@benreaves Thank you for sharing the files and details. I'll look into them!

안녕하세요 @Kwon-subin DTM 모델은 시간에 따라 토픽이 변하는 모델입니다. 따라서 토픽의 분포는 시간의 영향을 받으므로, 특정 토픽의 단어 분포를 구하려면 그 시간도 함께 명시해주어야 합니다. https://github.com/bab2min/tomotopy/blob/d30964ce0610a5e34d3645cfc8c26d99536cac03/tomotopy/coherence.py#L148-L158 그런데 제보해주신것처럼 현재 Coherence...

There are three possible causes. 1. Random Number Engine https://github.com/bab2min/tomotopy/blob/400f06006f928db39ce2832830271ad0c87981b3/src/TopicModel/TopicModel.hpp#L13-L28 2. Prefix Sum https://github.com/bab2min/tomotopy/blob/400f06006f928db39ce2832830271ad0c87981b3/src/Utils/sample.hpp#L123-L140 3. Eigen's redux function(sum()) The summation order can be varied by SIMD options. (1) and (2)...

Hi, @mayankchatteron1 Since version 0.8.2, you can obtain the same result in the same machine when the seed is fixed with only `workers=1`. If the result is different even though...

@mayankchatteron1 Yes. In a multithreaded environment, it is impossible to completely control the execution order by software. In order to control it, each inference step needs to be synchronized, which...