nlp icon indicating copy to clipboard operation
nlp copied to clipboard

Online/streaming LDA?

Open carbocation opened this issue 7 years ago • 1 comments

Is it possible to run LDA (or other processing algorithms) in a streaming/online fashion, such as is done with gensim? It seems that this would not easily support online processing, but I thought I'd bounce the question off of you since you know the internals much better.

carbocation avatar Aug 25 '18 22:08 carbocation

Great question. All the algorithms will work in an online setting but the majority require batch training in advance. Some, like the LDA and RI algorithms could be made to work with online training with a small amount of effort. The HashingVectoriser doesn't require training so is particular suited to streaming data. I will take a look and see if I can add the online training and persistence support. In the meantime, Pull Requests are welcome :-)

james-bowman avatar Aug 26 '18 08:08 james-bowman