DeBERTa icon indicating copy to clipboard operation
DeBERTa copied to clipboard

Pretraining the deberta-v3 by larger context length.

Open sherlcok314159 opened this issue 1 year ago • 2 comments

Hi! I find that Deberta-v3 uses relative-position embedding so that it can takes in larger context compared to traditional BERT. Have you tried to pretrain deberta-v3 by 1024 or larger?

If I need to pretrain deberta-v3 from the scratch using a larger context length (e.g., 1024), are there any modification I should make besides the training script?

Thanks for any kind help!

sherlcok314159 avatar Jul 12 '24 03:07 sherlcok314159

Hi, I did a multi-task fine-tune with 1280 context length (1680 for small version) https://huggingface.co/tasksource/deberta-base-long-nli

sileod avatar Sep 15 '24 14:09 sileod

Could you please open-source your code for learn?

sherlcok314159 avatar Sep 23 '24 08:09 sherlcok314159