tomotopy
tomotopy copied to clipboard
Python package of Tomoto, the Topic Modeling Tool
I am migrating away from `model.make_doc` to `tp.util.Corpus` and am finding that using Corpus segfaults. My tiny repro is here: ```python #!/usr/bin/env python3 import time import numpy as np import...
I have a 6.5GB model trained on 10M docs to model 100 topics trained the usual way. I'm trying to load the model and I'm finding that load times are...
I have a large corpus (30M docs) and a pretrained inference-only tomotopy model. I want to find the argmax topic for each doc in the corpus and have found through...
안녕하십니까 tomotopy의 dtm모델을 돌리다가 문의사항이 있어 글을 올립니다. 모델을 훈련시킨 후, 훈련에 사용된 문서들이 어느 토픽에 분류가 되었는지 확인하고자 DTModel.docs[i].get_topics를 활용하여 각 문서가 어떤 토픽으로 분류가 되었는지 확인하였습니다. 그 이후,...
I am training a simple LDA model. The _infer_ method allows the predictions of topic distributions of **unseen** out of the model documents. What should I do to get the...
Hi, big fan of the package, thanks for putting it together - I've had fantastic experiences with LDAModel and HDPModel, but am running into numerical stability issues as I'm trying...
Hello, I'm trying several models with different coherence measures, but I have some questions I need to understand. 1. Is the value of the `SLIDING_WINDOWS` fixed? or can I change...
Hi, I encoutered the below errors when installing tomotopy. Please help me fix it. Thank you very much: ``` In file included from src/python/py_rt.cpp:25: /Library/Developer/CommandLineTools/usr/lib/clang/13.1.6/include/cpuid.h:14:2: error: this header is for...
Hello, I'm very interested in your package and enjoying it a lot so far. I wonder if there is a way to show the term weights of the topic vocabularies...
Hello, I'm new to topic modeling and I'm trying the CTM model example. After loading my own data, which includes about 45000 documents, I get these messages during the process:...