wmd icon indicating copy to clipboard operation
wmd copied to clipboard

Paralleling processing

Open josepablog opened this issue 9 years ago • 1 comments

Wow, great paper! Thank you for making the code OSS.

The documentation says that the Python wrapper is not suitable for parallel execution:

The wrapper is not suited for concurrent execution. It uses a global variable for the distance callback function, so calling emd from concurrent threads will result in undefined behavior.

However, the function get_wmd calls emd concurrently. Can you please explain?

josepablog avatar Sep 30 '16 20:09 josepablog

Ah this looks like an oversight indeed! If you need parallel processing, maybe gensim's WMD code supports this? https://radimrehurek.com/gensim/models/word2vec.html#gensim.models.word2vec.Word2Vec.wmdistance

mkusner avatar Oct 21 '16 11:10 mkusner