simhash
simhash copied to clipboard
About Chinese
Does not support the Chinese ?
English speaking people are not very concerned by Unicode :) In the code sample provided, you see that he uses an ASCII string :
[]byte("this is a test phrase"),
However, the lib can support Unicode using go.text
simhash.Simhash(simhash.NewUnicodeWordFeatureSet(content, norm.NFKC))}
See this repo for a complete example : https://github.com/bbalet/gorelated