gpt-2-output-dataset
gpt-2-output-dataset copied to clipboard
why Roberta?
trafficstars
Why did you use Roberta and not use BERT or ELMO instead?
In an ablation study (that we didn't publish) we found that RoBERTa fine-tunes better than BERT or GPT-2 itself. We expect ELECTRA should work as well.