wang2vec
wang2vec copied to clipboard
Extension of the original word2vec using different architectures
Hi there, Is it possible to use wang2vec for training on a very large file (>70GB)? In gensim, this is made possible by the iterator [LineSentences](https://radimrehurek.com/gensim/models/word2vec.html#gensim.models.word2vec.Word2Vec), which > for larger...
If you run ./kmeans_txt, th instructions say to give input file and num classes, but this causes a segmentation fault. It also requires an output file after the input_file. The...
I want to train my word embeddings on more than one training files. Which command should I use for training the model using multiple input files?
Hi, I'd like to know if you have a pre-trained English model available for download. Thanks in advance.
I get a segmentation fault with high dimensions (600 or more) using cbow. A normal word2vec runs fine for this size but wang2vec does not. I am able to run...
I'm trying to train a model using part-of-speech tags as word classes. When I supply even a very small file of size ~1000 lines with word classes, word2vec causes Segmentation...
Hi Wang. I'd like to know if the way I use the original doc2vec for inferring sentence/document vectors is the same I use your structural extension. Thank you very much...
Hi Wang, Thanks for sharing the code. I have the following two questions. What can weightedword2vec do ? Is there the implementation of attention based cbow model ?
Hello Mr. Ling, Thank you for this work. I'm having the following issue and was wondering if you might know the reason. I'm running the following command: ./word2vec -train Corpus.txt...