Results 105 comments of Minchul Lee

Hi @jlevy44 Sorry for late reply. Could you share more detail about your environment including OS, Python version and tomotopy version?

Hi @jlevy44 `bad_alloc` exception is thrown usually when the memory allocation is failed. But I think system memory is enough to train DTM in most of use case. So we...

@MarkWClements You can provide `uid` as optional argument to `Corpus.add_doc` like followings: ```python corpus = tp.utils.Corpus() corpus.add_doc(some_words, uid="doc1") corpus.add_doc(some_words, uid="doc2") corpus.add_doc(some_words, uid="doc3") ``` I'll supplement the documentation about this.

Hi @MarkWClements 1. Currently, there is no feature about modifying `uid`. I'll add it to future development features. 2. Usually, `trained_docs[n]` is the same document as `docs[n]`, except a few...

Hello @MarkWClements The current version of tomotopy uses only a sample from the latest model state. Although this often leads to inaccurate estimation, this method was adopted because of its...

Hi @Frank-Gao-UCB , you can access the documents in the model and get topic distribution of them as follows: ```python mdl = LDAModel(...) # ... some works for adding documents...

Hi @benreaves You can run python with environment variable `export TOMOTOPY_ISA=none` if you want to turn off SIMD optimization. The environment variable `TOMOTOPY_ISA=none` disables dynamic binary loading of tomotopy, forces...

Hi @crawlserv , Actually, Maximal Marginal Relevance is not implemented yet. So the lambda parameter in c++ code is currently just a dummy. When the implementation is complete, I plan...

Hello @MVP1996 , Thank you for sharing your details. First, to briefly answer each question: * It seems like a good idea to get the initial values of mu and...

안녕하세요 @tkddnd0214 `docs[i].get_topics`는 i번째 doc에 포함된 상위 n개(주로 10개) 토픽 분포를 보여줍니다. `DTModel.get_topic_words`는 지정한 토픽의 상위 n개(주로 10개)의 단어 분포를 보여주고요. 애초에 전자는 상위 토픽을 보여주고 후자는 상위 단어를 보여주는...