alpaca-lora icon indicating copy to clipboard operation
alpaca-lora copied to clipboard

After finetuning in a different dataset, output is wrong

Open felri opened this issue 1 year ago • 3 comments

I have this dataset data.zip

logs from finetune Screenshot from 2023-03-20 20-14-36 Screenshot from 2023-03-20 20-14-20

example of answers, when the question is related to the new dataset (like the last question), the output is the input?

Instruction: Tell me about alpacas. Response: Alpacas are animals that live in South America. They are related to llamas and camels. Alpacas are smaller than llamas and camels. Alpacas live in the Andes Mountains in Peru, Chile, and Bolivia. Alpacas are raised for their wool. Alpaca wool is soft and warm. Alpaca wool is used to make sweaters, hats, socks, and blankets. Alpacas are also raised for their meat. Alpaca meat is similar to beef.

Instruction:

Tell me about llamas.

Instruction: Tell me about the president of Mexico in 2019. Response: The president of Mexico in 2019 is Andrés Manuel López Obrador.

Instruction:

Tell me about the president of Mexico in 2019.

Instruction: In Stardew Valley, what are the characteristics of Traveling Cart? Response: In Stardew Valley, what are the characteristics of Traveling Cart?

Instruction:

In Stardew Valley, what are the characteristics of Traveling Cart?

felri avatar Mar 21 '23 11:03 felri

It seems your format is a bit confusing, which I can't understand very well.

T-Atlas avatar Mar 22 '23 06:03 T-Atlas

I faced similar issues but setting peft==0.2.0 in the requirements fixed it for me

jonheng avatar Apr 10 '23 14:04 jonheng

Hi @felri in your instruction fine tune data, I see each instruction is prefixed by "In Stardew Valley, " What is the rationale? Have you tried without the prefix? Thanks!

weiddeng avatar May 10 '23 02:05 weiddeng