cosFormer
cosFormer copied to clipboard
Pre-train model
In the paper,it mentioned that the work of the bidirectional language modeling pre-train has been done. Are you planning on releasing some pre-trained weights for the model?