examples Simpler or current model I should use to predict probabilities?

Simpler or current model I should use to predict probabilities?

Open Haoyoudoing opened this issue 7 months ago • 1 comments

Thanks for publishing such a great project for finding data issues. After reviewing some of the examples, I would like to hear your guidance for the following situation:

How to find human annotators' error labels during active learning to fine-tune a sentence transformer model for a text classification task. Should I use a simpler model, i.e. a logistic regression model, to generate the probabilities for confident learning, or should I use the current fine-tuned sentence transformer to do the job? Will this make a big difference?

Jul 10 '24 20:07 Haoyoudoing

examples examples copied to clipboard

Simpler or current model I should use to predict probabilities?

examples
examples copied to clipboard