FastChat icon indicating copy to clipboard operation
FastChat copied to clipboard

Can I further fine-tune on the Vicuna-13B base?

Open theslugger opened this issue 2 years ago • 1 comments

theslugger avatar Apr 14 '23 07:04 theslugger

AFAIK, In terms of fine-tuning, there should be no difference from starting from the LLaMA base and vicuna 13b fine tuned models.

musabgultekin avatar Apr 15 '23 15:04 musabgultekin

@theslugger Yes you can. You can either start from the LLama 13b or Vicuna 13B.

There indeed is a difference on which you start fine-tuning. I believe starting from Vicuna at least make sure the model has seen vicuna data before. Closing this ticket. Good luck fine-tuning yourself!

zhisbug avatar Apr 21 '23 04:04 zhisbug

@zhisbug If we start from Vicuna 13B, do we need to modify the format of the dummy.json to have the system prompt?

karan-dalal avatar Jun 05 '23 03:06 karan-dalal