Stratified-Transformer
Stratified-Transformer copied to clipboard
How to finetune Stratified-Transformer
Which layers' gradients should I freeze or unfreeze in the model?