text2vec icon indicating copy to clipboard operation
text2vec copied to clipboard

SWEM-concat Implementation in text2vec

Open raywyf opened this issue 4 years ago • 0 comments

Hello!

I'm currently using text2vec to create embeddings for a dataset of tweets. Since each document is pretty short, I want to implement a Simple Word Embedding Model (SWEM), specifically SWEM-concat, whereby I concatenate together the average of all word vectors in a document, and the result from max-pooling. This method is discussed in this paper.

I can get the document averages by normalizing the dtm object and then taking the dot product of that with the word vector object, but I'm struggling with how to get the max-pooling results. Any help would be much appreciated!

Thanks!

raywyf avatar May 11 '21 13:05 raywyf