Minchul Lee comments

Results 123 comments of


                                            Minchul Lee

Question: Showing vocabularies weights

Hi @juneMJ Thank you for your suggestion. As you said, current version of tomotopy doesn't provide API for accessing weights value of each term of vocabularies or for adding custom...

Providing a conda package

Thank you for good suggestion, @ZechyW After looking at conda-forge, I'll add distributions for conda.

"RuntimeError: Either `words` or `rawWords` must be filled" using `add_doc` sometimes

Hi @batmanscode , It seems that there is an empty document in your `df['clean_text']`. Could you check the value of `df['clean_text']` to make sure there are no blank documents?

"RuntimeError: Either `words` or `rawWords` must be filled" using `add_doc` sometimes

@batmanscode `df.isnull()` tests only if the value is `NA` or not. Because an empty str `''` is not `NA`, it doesn't show any empty strings. Try following: ```python df['clean_text'].apply(lambda x:bool(x.strip())).value_counts()...

"RuntimeError: Either `words` or `rawWords` must be filled" using `add_doc` sometimes

@batmanscode Currently, `add_doc` has no such feature. But I think it's a good idea to add the option to ignore empty docs.

Explaining messages in CTM model example

Hello @juneMJ, thank you for your interest to my package. The first and second warning are related to numerical stability of tomotopy's CTM implementation. Currently, it seems to be a...

Support for Apple Silicon M1

Hi @stephangreene , Currently, tomotopy don't support Apple Silicon M1. I am working on M1 support, but this may takes some time. Please use Rosetta until M1 is fully supported.

Question about topic assignments with LDA models

@alonjacovi Sorry for the late reply. Oh, I accidentally forgot to answer it. You can get topic assignment of each word in a trained or an inferred document. (https://bab2min.github.io/tomotopy/v0.11.0/en/#tomotopy.Document.topics) Get...

Loading a fairly large model takes a long time

Hi @erip Yes, the model saved with `full=True` argument has all parameters related to whole train, thus it may take a long time to re-load it. If you save the...

Loading a fairly large model takes a long time

@erip Actually, the package doesn't use numpy binarization for loading & saving, but uses a custom serialization functions. And it is true that there are many features related to backward...