Refresh icon indicating copy to clipboard operation
Refresh copied to clipboard

Errors in Word Embedding Vector file.

Open Nazzish opened this issue 7 years ago • 2 comments

I am facing issues in the word embedding vector file. Please check if it has some errors after line number 500000, and I am unable to download the new file form the link http://kinloch.inf.ed.ac.uk/public/Refresh-NAACL18-1-billion-benchmark-wordembeddings.tar.gz provided in the README.md (403 error, FORBIDDEN)

Nazzish avatar Oct 26 '18 05:10 Nazzish

I'm getting the following error repeatedly.

Traceback (most recent call last): File "document_summarizer_training_testing.py", line 282, in tf.app.run() File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/platform/app.py", line 30, in run sys.exit(main(sys.argv)) File "document_summarizer_training_testing.py", line 277, in main train() File "document_summarizer_training_testing.py", line 89, in train vocab_dict, word_embedding_array = DataProcessor().prepare_vocab_embeddingdict() File "/home/cvml/Documents/Enforcement/ReRankZip/Refresh-master/data_utils.py", line 339, in prepare_vocab_embeddingdict embeddata = [float(item) for item in linedata[1:]][0:FLAGS.wordembed_size] ValueError: invalid literal for float(): .0950.099540928

Please Guide

Nazzish avatar Oct 26 '18 06:10 Nazzish

Our server was down for some reason. That is why you were not able to download those files. I have raised the issue. It should be up and running in few minutes.

Is your second problem related, or did I misunderstand?

shashiongithub avatar Oct 26 '18 10:10 shashiongithub