WizardLM icon indicating copy to clipboard operation
WizardLM copied to clipboard

new format, new finetune code?

Open ehartford opened this issue 1 year ago • 4 comments

Llamax code it knows how to handle alpaca formatted QA data, but I didnt' see anything in there to handle ShareGPT format data,

How do I finetune with the new format? Your finetune guide (https://github.com/nlpxucan/WizardLM/blob/main/WizardLM/README.md#fine-tuning) still references the 70k dataset

ehartford avatar Jun 16 '23 04:06 ehartford

Your finetune guide (https://github.com/nlpxucan/WizardLM/blob/main/WizardLM/README.md#fine-tuning) still references the 70k dataset

Yep, noticed this too. Perhaps @nlpxucan forgot to update that particular section of the readme during the last commit (2 days ago).

chat-guy avatar Jun 16 '23 05:06 chat-guy

perhaps this

https://github.com/nlpxucan/WizardLM/blob/94f9c792df4b91589c8c236a566ddc63d4868ec2/WizardLM/src/train_freeform.py#LL50C5-L50C17

ehartford avatar Jun 16 '23 05:06 ehartford

Or maybe they used FastChat rather than Llamax.

ehartford avatar Jun 16 '23 05:06 ehartford

gonna presume we are using FastChat until I hear otherwise.

ehartford avatar Jun 17 '23 11:06 ehartford