ViP-LLaVA icon indicating copy to clipboard operation
ViP-LLaVA copied to clipboard

[Question] Finetune Stage 2 Model

Open Xuefei98 opened this issue 10 months ago • 3 comments

Question

First of all, great work and thank you so much for open-source it! I wonder if the stage 2 model(referred as ViP-LLaVA-Base) has been released anywhere? Maybe mucai/vip-llava-13b-pretrain? I am trying to finetune the stage 2 model using custom GPT instruction data. I am looking at scripts/finetune_stage3.sh and wonder if that's the correct script? But model used in the script is ./checkpoints/vip-llava-$model_size-stage2-ft and I dont really see it anywhere. Thank you!

Xuefei98 avatar Apr 18 '24 16:04 Xuefei98

Hi xuefei,

Thanks for bringing this point! I just uploaded the 7B stage 2 model: https://huggingface.co/mucai/vip-llava-7b-base

Mu

mu-cai avatar Apr 19 '24 12:04 mu-cai

Hi Mu,

Thank you so much for getting back to me! Is it possible for you to also share the 13B model? I would like to fine tune both 7B and 13B model and compare the performance for my experiments.

Xuefei

Xuefei98 avatar Apr 19 '24 13:04 Xuefei98

You can now find 13b base model here! https://huggingface.co/mucai/vip-llava-13b-base

mu-cai avatar Apr 22 '24 22:04 mu-cai