molenc
molenc copied to clipboard
add L_1 norm
aka "Manhattan distance" it is supposed to behave well in high dimensions
cf.
Aggarwal, C. C., Hinneburg, A., & Keim, D. A. (2001, January). On the surprising behavior of distance metrics in high dimensional space. In International conference on database theory (pp. 420-434). Springer, Berlin, Heidelberg.
Also, implement a fractional distance metric, where f in ]0:1] could be optimized depending on the application.
we use Tanimoto when FPs are high dimensional