hoshi-hiyouga comments

Results 294 comments of


hoshi-hiyouga

fix MoD related stuff

> No problem, I'll do it tomorrow. Alse please send the error it gives you, I can't manage to reproduce it script: ``` CUDA_VISIBLE_DEVICES=0 python src/train_bash.py \ --stage sft \...

fix MoD related stuff

too many dummy commit in this pr, open another one should be more efficient btw, qwen1.5 models do not work for me in both non-FA2 and FA2 paths

On merging the PT Lora Adaptor to base model it increased the size to almost double to that of orig base model.

It may be because the embed tokens and lm head are saved in 32-bit precision, leading to an increase in the file size. You can merge the lora adapter using...

Custom Format Datasets

now we provide the `empty` template that allows training on the text without a specific format, you can pre-process your dataset and template them, and use `empty` template to fine-tune...

Llama-3 model can generate texts in our Linux environment. I think it is likely an issue with your hardware and environment. ![image](https://github.com/hiyouga/LLaMA-Factory/assets/16256802/fa5f2dde-1fc2-45c0-84e5-cf1cfe89f736)

support "stop" in api chat/completions

@JieShenAI 还没支持。

hoshi-hiyouga

fix MoD related stuff

fix MoD related stuff

fix MoD related stuff

On merging the PT Lora Adaptor to base model it increased the size to almost double to that of orig base model.

合并权重时出现进程被取消的情况

Custom Format Datasets

Custom Format Datasets

Custom Format Datasets

LLaMA3-8B won't inference

support "stop" in api chat/completions