cpystan
Results
23
comments of
cpystan
We find that the checkpoint we uploaded is wrong and should not be used for test. We are so sorry for our mistake. We will upload a new ckpt later.
Just like transformer, you need to feed 'text' to the decoder when training.
For inference, special token of [START] is fed to the decoder so as to achieve autoregressive generation.