MiniGPT-4 icon indicating copy to clipboard operation
MiniGPT-4 copied to clipboard

Is there a first pretraining stage model for Llama 7b?

Open johnmullan opened this issue 1 year ago • 3 comments

Or should we just carry on from the existing checkpoint?

Thanks

johnmullan avatar Sep 28 '23 13:09 johnmullan

I have the same question, and everyone can also take a look at this discussion

slacklife avatar Oct 10 '23 04:10 slacklife

i have the same question, have u resolved it ?

terminator123 avatar Nov 29 '23 07:11 terminator123

Same question, I use the provided 7B MiniGPT-4 checkpoint for the second FT with Llama2, but it gives error size mismatch for llama_proj.weight: copying a param with shape torch.Size([4096, 768]) from checkpoint, the shape in current model is torch.Size([4096, 5632]). However, it works fine when I use the ckpt for second FT with Vicuna 7B, may be this ckpt is for Vicuna 7B only. Could you please provide the first stage ckpt that works for Llama2. Thank you very much!

zhangwengyu999 avatar Dec 28 '23 05:12 zhangwengyu999