word-embeddings-benchmarks issues

urllib.error.URLError: <urlopen error [WinError 10060]

....... data = url_opener.open(request) File "C:\ProgramData\Anaconda3\lib\urllib\request.py", line 525, in open response = self._open(req, data) File "C:\ProgramData\Anaconda3\lib\urllib\request.py", line 542, in _open result = self._call_chain(self.handle_open, protocol, protocol + File "C:\ProgramData\Anaconda3\lib\urllib\request.py", line 502,...

foxghoul

Could you provide some domain independent embedding vaild methods ？

I take a glace at this project, the main idea of this project is to valid the embedding conclusion by labeled data (most are fixed provided by human setting word...

svjack

Can you retrieve counter_of_words from OrderedVocabulary object ?

1

in the function transform_words of Embedding object in embedding.py, following code i can not understand: elif isinstance(self.vocabulary, OrderedVocabulary): # range in python3 is lazy counter_of_words = range(len(self.vocabulary.words) - 1, -1,...

svjack

Many dataset for fetch are non-available.

1

Many datasets in similarity.py fetch_* are not available, i.e. can not download from "dropbox" Could you favor to provide them in Google Drive or other cloud location ? Thank you.

svjack

Reason for scaling similarity scores

Hi! Thank you for creating this repo. Could you please tell us why you decided to multiply the MTurk similarity scores by 2: https://github.com/kudkudak/word-embeddings-benchmarks/blob/8fd04891a92d313cc3b6956a43f25c9e44022e0e/web/datasets/similarity.py#L37 and those of the RG65 dataset...

jabalazs

Different evaluation results for the same embeddings.

Hi, there This is a very useful tool that can help us evaluate word embeddings in a consistent way. However, during the evaluation process, I found a bug. I have...

flyaway1217

SG Google News embeddings (word2vec) are reaching 0.726 on Google word analogy corpus

2

Thanks a lot for providing this code, this helps for a fair comparison of frameworks. According to my experiments with your tool, SG Google News embeddings (word2vec) are reaching a...

hanselowski

word-embeddings-benchmarks
word-embeddings-benchmarks copied to clipboard

Metadata

urllib.error.URLError: <urlopen error [WinError 10060]

Could you provide some domain independent embedding vaild methods ？

Can you retrieve counter_of_words from OrderedVocabulary object ?

Many dataset for fetch are non-available.

Reason for scaling similarity scores

Different evaluation results for the same embeddings.

SG Google News embeddings (word2vec) are reaching 0.726 on Google word analogy corpus

Add a very small "test" embedding

Analogy solver should normalize embeddings

Add flake8 to travis

← Metadata

Owner

Metadata

word-embeddings-benchmarks word-embeddings-benchmarks copied to clipboard

Metadata

← Metadata

Owner

Metadata

word-embeddings-benchmarks
word-embeddings-benchmarks copied to clipboard