Ash

Results 7 issues of Ash

### Is there an existing issue for this? - [ ] I have searched the existing issues ### Current Behavior 搜集了50万条的中文语料,语料的格式为:{"instruction": "输入一个数,判断它是否是质数。 \n数字: 47", "input": "", "output": "是质数。"},如果取50万条中的前1万条进行微调,加载微调后的模型进行推理,可以得到回复,但是加载用全部数据微调的模型进行推理,却不能给出回复,请问导致这个问题的原因是什么,又该如何解决呢? ### Expected...

Excuse me, I have a few questions to ask,and I am loking forward to your answer: I use passthrough and slerp to merge qwen14B, here is my passthrough yaml: ```yaml...

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction ![image](https://github.com/hiyouga/LLaMA-Factory/assets/33438938/fe471ca3-5477-4be0-9916-ffb8a6f7eddc) 请问如何解决 ### Expected behavior _No response_ ### System Info _No response_ ### Others...

pending

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction 项目更新后,发现程序启动方式变为 llamafactory-cli xxx 这样的方式,老实说这样的方式体验感很糟,不能定位到底运行了哪个文件,不如之前的python src/... 这样的方式清晰,容易定位文件,运行报错时,也容易追踪和定位。请考虑是否改回来。 ### Expected behavior _No response_ ### System Info...

### Reminder - [X] I have read the README and searched the existing issues. ### System Info 论文地址:https://arxiv.org/abs/2402.14740 huggingface trl 库实现地址:https://github.com/huggingface/trl/blob/main/trl/trainer/rloo_trainer.py ### Reproduction None ### Expected behavior _No response_ ###...

enhancement
pending

I am using LLama-Factory to train long text DPO, but enabling unsloth is not supported with the latest version of the trl library. The newest trl update includes many useful...

fixed - pending confirmation

When attempting to merge three 14B LLMs on a custom task using the mergekit-evolve method, I ran into memory overflow issues on 8 A100 GPUs, each with 80G of memory....