PolyFuzz icon indicating copy to clipboard operation
PolyFuzz copied to clipboard

to calculate semantic siliartity between two words such as happy and sad

Open xinli2008 opened this issue 2 years ago • 1 comments

Hello, can this tool be used to calculate the semantic simliarity between two words such as "happy" and "sad"? @MaartenGr

xinli2008 avatar Mar 29 '22 09:03 xinli2008

Yes! PolyFuzz is meant to find the distance between two sets of strings. Distance might mean string-distance, as in how many changes do you need to take to go from one string to another, or it might mean the distance in semantic similarity.

Semantic similarity is typically extracted using embedding techniques, such as Word2Vec, FastText, or the transformer models that have shown tremendous performance boosts. To use one of these techniques, you can follow along with the guide here.

MaartenGr avatar Mar 29 '22 16:03 MaartenGr