MiniGPT-4 Is there a first pretraining stage model for Llama 7b?

Is there a first pretraining stage model for Llama 7b?

Open johnmullan opened this issue 1 year ago • 3 comments

Or should we just carry on from the existing checkpoint?

Thanks

Sep 28 '23 13:09 johnmullan

I have the same question, and everyone can also take a look at this discussion

Oct 10 '23 04:10 slacklife

i have the same question, have u resolved it ?

Nov 29 '23 07:11 terminator123

Same question, I use the provided 7B MiniGPT-4 checkpoint for the second FT with Llama2, but it gives error size mismatch for llama_proj.weight: copying a param with shape torch.Size([4096, 768]) from checkpoint, the shape in current model is torch.Size([4096, 5632]). However, it works fine when I use the ckpt for second FT with Vicuna 7B, may be this ckpt is for Vicuna 7B only. Could you please provide the first stage ckpt that works for Llama2. Thank you very much!

Dec 28 '23 05:12 zhangwengyu999

MiniGPT-4 MiniGPT-4 copied to clipboard

Is there a first pretraining stage model for Llama 7b?

MiniGPT-4
MiniGPT-4 copied to clipboard