ml4ir icon indicating copy to clipboard operation
ml4ir copied to clipboard

Loss and evaluation metrics for graded relevance labeled records (for public datasets)

Open mohazahran opened this issue 5 years ago • 2 comments

Few public ranking datasets are labeled with graded relevance (score from 1 to 5) rather than clicks. Which means the target prediction is not necessarily binary vector. This means MRR is not suitable as an evaluation metric (NDCG is more natural in this case). Also, a special attention needs to be taken for the loss function as well. @lastmansleeping @jakemannix

mohazahran avatar Jul 30 '20 20:07 mohazahran

Perfect. Let's prioritize adding these. @mohazahran

lastmansleeping avatar Jul 30 '20 20:07 lastmansleeping

This issue has been linked to a new work item: W-7983051

git2gus[bot] avatar Aug 19 '20 18:08 git2gus[bot]