DETM icon indicating copy to clipboard operation
DETM copied to clipboard

Streamin Data Sources

Open tarikaltuncu opened this issue 4 years ago • 3 comments

Hi, is D-ETM good for streamin data sources? How do you suggest training in mini-batches of data as it accumulates?

I also have another related question. I understand that DETM can adjust the model for concept drifts by shifting the mean and varience of topics by gaussian distribution. What if there is a particular topic that exists only (and heavily) in only one (or a few) particular time interval. Would DETM be able to detect it? Or is there an assumption that all topics exist at all time intervals?

tarikaltuncu avatar Sep 26 '20 14:09 tarikaltuncu

No, from my experience, all the topics don't exist at all time.

Yes, there will surely be a topic that occurs only one in a particular time interval.

Emekaborisama avatar Oct 06 '20 13:10 Emekaborisama

When you say streaming data source, do you mean realtime data from google sheet or SQL?

Emekaborisama avatar Oct 06 '20 13:10 Emekaborisama

You might want to have a look at the paper on Dynamic Mixture Models by Wei et al. (2007) and similar papers on topic modeling for streaming data.

mona-timmermann avatar Oct 19 '20 14:10 mona-timmermann