wmd
wmd copied to clipboard
Paralleling processing
Wow, great paper! Thank you for making the code OSS.
The documentation says that the Python wrapper is not suitable for parallel execution:
The wrapper is not suited for concurrent execution. It uses a global variable for the distance callback function, so calling
emdfrom concurrent threads will result in undefined behavior.
However, the function get_wmd calls emd concurrently. Can you please explain?
Ah this looks like an oversight indeed! If you need parallel processing, maybe gensim's WMD code supports this? https://radimrehurek.com/gensim/models/word2vec.html#gensim.models.word2vec.Word2Vec.wmdistance