python convert-wordemb-dict2emb-matrix.py ./Data/training.dict.pkl ./wordEmb/GoogleNews-vectors-negative300.bin Word2Vec_WordEm
The following non-word tokens will not be extracted from the pretrained embeddings: ['', '', '', '', '', '.', ',', '``', "''", '[', ']', '`', '-', '--', "'", '', '<first_speaker>', '<second_speaker>', '<third_speaker>', '<minor_speaker>', '<voice_over>', '<off_screen>', '']
INFO:covert-wordemb-dict2emb-matrix:Word2Vec embeddings given as input
INFO:covert-wordemb-dict2emb-matrix:No spelling corrections will be applied
INFO:covert-wordemb-dict2emb-matrix:Final word embedding dim: 300
INFO:covert-wordemb-dict2emb-matrix:Final standard deviation: 0.010000
INFO:covert-wordemb-dict2emb-matrix:Vocabulary size: 24
Using TensorFlow backend.
INFO:summa.preprocessing.cleaner:'pattern' package not found; tag filters are not available for English
Traceback (most recent call last):
File "convert-wordemb-dict2emb-matrix.py", line 129, in
embedding_dict = gensim.models.Word2Vec.load_word2vec_format(args.embedding_dictionary, binary=True)
File "/usr/local/lib/python2.7/dist-packages/gensim/models/word2vec.py", line 1420, in load_word2vec_format
raise DeprecationWarning("Deprecated. Use gensim.models.KeyedVectors.load_word2vec_format instead.")
DeprecationWarning: Deprecated. Use gensim.models.KeyedVectors.load_word2vec_format instead.