loss_dropper icon indicating copy to clipboard operation
loss_dropper copied to clipboard

Is there any way to apply this work with pretrained model( e.g. BART, T5 ) ?

Open ElderWanng opened this issue 4 years ago • 1 comments

I'm really interested in your great work. Just curious, If it is possible that combine BART with loss truncation? Cuz the vanilla LSTM with attention is kind of out-of-date.

ElderWanng avatar Oct 19 '21 11:10 ElderWanng

Hi @ElderWanng you can take a pretrained model and continue training it with loss truncation - we find it works quite well.

ddkang avatar Oct 19 '21 16:10 ddkang