gpl icon indicating copy to clipboard operation
gpl copied to clipboard

GPL with low performant CE

Open IliasAarab opened this issue 1 year ago • 0 comments

Does it make sense to train a model using GPL, when the CE used for pseudo labelling is a bad performer on the domain dataset (i.e. when using the CE directly for IR tasks on the domain dataset, the results are poor)? I would think the GPL trained model would also be a poor performer as the CE performance represents the upperbound the GPL can achieve.

If my reasoning is correct, is there a way to deal with this shortcoming?

IliasAarab avatar Jun 13 '23 16:06 IliasAarab