metric-learn Allow support for multi-label algorithms

Allow support for multi-label algorithms

Open wdevazelhes opened this issue 6 years ago • 7 comments

Multi-labels problems with a lot of labels are a good use case of metric learning, so we could add support for it in the algorithms. In supervised ones it would mean modifying the loss function a bit (we have been discussing it with @bellet for NCA's PR in scikit-learn for instance) For weakly supervised ones it would mean make tuples from multi-labeled data (it seems that there are several strategies to do so, like how much labels do points share, etc...)

Feb 26 '19 16:02 wdevazelhes

Could you share the link to NCA PR in scikit-learn? Are you reusing what’s available in metric-learn?

Feb 26 '19 16:02 terrytangyuan

Could you share the link to NCA PR in scikit-learn?

Sure, here is the link: https://github.com/scikit-learn/scikit-learn/pull/10058

Are you reusing what’s available in metric-learn?

In fact I reused a lot of a PR about LMNN (https://github.com/scikit-learn/scikit-learn/pull/8602) for the architecture of the code, and just replaced the function with NCA's loss function. This PR is quite developed with respect to the error messages, the checks of the parameters, the automatic initialization, etc, so I guess we could get some of the developments from this PR in metric-learn (that's already what we did in some PRs like #113, #105, and #99)

Feb 26 '19 16:02 wdevazelhes

Btw the PR has been merged in scikit-learn recently ! :tada:

Mar 08 '19 08:03 wdevazelhes

Nice job! Congrats!

Mar 08 '19 15:03 terrytangyuan

Thanks !

Mar 08 '19 15:03 wdevazelhes

Also to mention that it was @GaelVaroquaux who originally suggested to investigate the multi-label setting ;-)

Mar 13 '19 18:03 bellet

Looking forward to this development if it is still a thing

Mar 30 '21 23:03 angelotc

metric-learn metric-learn copied to clipboard

Allow support for multi-label algorithms

metric-learn
metric-learn copied to clipboard