LMFlow icon indicating copy to clipboard operation
LMFlow copied to clipboard

What's the difference between these two finetune methods for enhanced datasets?

Open intothephone opened this issue 2 years ago • 1 comments

AFAIK,there're two methods to continue finetuning based on the previously finetuned model:

  1. (base model + lora model + dataset1) -> (base model + finetuned model + dataset2)
  2. (base model + lora model + dataset1) -> (full finetuned model + lora model + dataset2)

here "full finetuned model" in the second method means the training result of using the script "run_finetune_with_lora_save_aggregated_weights.sh"

If dataset2 has better quality, which method should I use? Thanks in advance.

intothephone avatar Apr 22 '23 12:04 intothephone

Hi, Currently, we are also curious about the performance of these two methods. It may need more comprehensive experiments to explore. If you have time, feel free to investigate this by experiments and welcome to share results with us.

Thank you!

shizhediao avatar Apr 22 '23 17:04 shizhediao

This issue has been marked as stale because it has not had recent activity. If you think this still needs to be addressed please feel free to reopen this issue. Thanks

shizhediao avatar May 15 '23 00:05 shizhediao