Top2Vec icon indicating copy to clipboard operation
Top2Vec copied to clipboard

Top2Vec as an updating model

Open Uzay-G opened this issue 3 years ago • 1 comments

Hello,

I was wondering how feasible it would be to adapt top2vec or other topic modeling methods to make them online, as in you can update the model with new data as you go.

Would you have any advice on how one could do this in top2vec? would it be feasible?

I am seeding an initial top2vec model and then have a stream of new data, am wondering how I could best do this so I don't have to retrain the model each time.

Uzay-G avatar May 03 '22 09:05 Uzay-G

Depending on the use-case and variability of the corpus you can just use the add_documents to add new documents to an existing model. If the corpus is constantly changing you will need to re-train the model occasionally. There will hopefully be a "re-compute topics" method coming soon.

ddangelov avatar May 04 '22 17:05 ddangelov

Is there a "re-compute topics" method coming?

AIRobotZhang avatar Nov 17 '22 09:11 AIRobotZhang