llama.cpp About dialogue training mode

For the finetune, the default is use text dataset（e.g. shakespeare.txt）, does it support dialogue training mode？

Apr 30 '24 02:04 andyyumiao

It does not support a specific mode for dialogue training such a thing does not exist to my knowledge. Instruction tuning and RLHF could loosely be described as dialogue training.

Consider if your issue is an issue that is relevant to the development llama.cpp.

If it is not. I recommend reading up on database creation, and instruction tuning, here is a technical paper. There are also plenty of resources and guides on youtube and around the internet.

If it really is an issue relevant to the development of llama.cpp, please explain it in much greater detail. What do you want to achieve? What have you done so far to achieve it?

May 14 '24 23:05 arnfaldur

It does not support a specific mode for dialogue training such a thing does not exist to my knowledge. Instruction tuning and RLHF could loosely be described as dialogue training.

Consider if your issue is an issue that is relevant to the development llama.cpp.

If it is not. I recommend reading up on database creation, and instruction tuning, here is a technical paper. There are also plenty of resources and guides on youtube and around the internet.

If it really is an issue relevant to the development of llama.cpp, please explain it in much greater detail. What do you want to achieve? What have you done so far to achieve it?

Yes, actually I mean the instruction tuning. Is there a demo for it on llama.cpp?

May 28 '24 00:05 andyyumiao

Not to my knowledge or expectations. This is not the place to ask about this either. I recommend doing some research about what you're trying to do, elsewhere. There are plenty of good introductory articles, and videos on youtube, if you prefer.

May 29 '24 10:05 arnfaldur

llama.cpp llama.cpp copied to clipboard

About dialogue training mode

llama.cpp
llama.cpp copied to clipboard