adapters icon indicating copy to clipboard operation
adapters copied to clipboard

pretrained Deberta models fail to load in 3.1

Open sanderland opened this issue 2 years ago • 0 comments

Environment info

  • adapter-transformers version: master / 3.1a0

Information

Model I am using: Deberta

When loading it using model = AutoAdapterModel.from_pretrained("microsoft/deberta-base")

It returns a size mismatch for deberta.encoder.layer.[n].attention.self.in_proj.weight: copying a param with shape torch.Size([2304, 768]) from checkpoint, the shape in current model is torch.Size([768, 2304]). Same for "HannahRoseKirk/Hatemoji"

Additional tests

  • deberta-v3 (=V2 in implementation) works
  • adapter-transformers 3.0.1 does not have deberta

sanderland avatar Jul 27 '22 13:07 sanderland