sonnet
sonnet copied to clipboard
Is there any compressive transformers pre-trained model releasing plan?
Hi, thx for your guys' great works. Is there any compressive transformers pre-trained model releasing plan? Want to see CT as backbone of GPT2 or BERT on long document like Longformer(a new sparse attention transformer variant)