Diffusion-BERT
Diffusion-BERT copied to clipboard
How to fine-tune it
It seems that a pre-trained language model. Could i run train on a lot of unconditional text to get a checkpoint then fine-tuning the model on Seq2seq tasks?
Sure! (Continue) pre-training and finetuning with the diffusion objective are both supported.