mergekit icon indicating copy to clipboard operation
mergekit copied to clipboard

Mergekit support for GPT2 failing

Open NamburiSrinath opened this issue 1 year ago • 2 comments

Hi @cg123,

Great library, thanks a lot, super useful!

I've finetuned GPT2 on 2 tasks (model1 and model2) and am trying to merge using your repo.

It turns out, using the original model as the base throws one or the other error (more details here - https://github.com/huggingface/transformers/issues/29184 as I've raised it)

But using one of the model as the base model (base_model: model1) is working fine. Is this expected?

Thanks a ton, looking forward.

NamburiSrinath avatar Feb 22 '24 03:02 NamburiSrinath