PairCNN-Ranking icon indicating copy to clipboard operation
PairCNN-Ranking copied to clipboard

how to slove this problem: the number of negative instance is far more than positive instance

Open yuye2133 opened this issue 7 years ago • 4 comments

In a rank task, one query maybe have many documents, if <query, document_(i)> label 0 or 1, the number of 0 will be far more than 1, and then the trainning model cannot fit test set very well.

yuye2133 avatar Apr 09 '17 14:04 yuye2133

@yuye2133 Actually your problem is label imbalance. You should try some strategies to deal with that.

zhangzibin avatar Apr 10 '17 02:04 zhangzibin

但是这类Rank问题不是都存在这种不平衡问题吗?请问你是怎么处理这种不平衡的啊?

yuye2133 avatar Apr 10 '17 12:04 yuye2133

同求,理解应该是模型对问题打分 然后根据打分排序

gallupliu avatar Jul 07 '17 02:07 gallupliu

@yuye2133 @gallupliu I use resampling usually. The best strategy depends on the problem.

zhangzibin avatar Feb 28 '18 00:02 zhangzibin