KG-BART
KG-BART copied to clipboard
How to use the pretrained model to finetune for bigger documents ??
I see the max_seq_length and max_position is set to 32 and 64. Currently, when I was trying to do it with bigger size I was getting OOM even for batch size 1 I don't understand max_len_a , max_len_b and max_pred ? Afaik these aren't specific to the BART paper