LLaMA-Factory issues

Results 548 LLaMA-Factory issues

Sort by recently updated

reward模型训练loss为0

![2023-06-16 19-10-21屏幕截图](https://github.com/hiyouga/LLaMA-Efficient-Tuning/assets/44988584/338bea40-53d4-42b2-afaa-62fc974844f7) 可能是什么问题？

neverstoplearn

pending

作者您好，关于使用baichuan-7B做增量预训练有几个问题： ``` CUDA_VISIBLE_DEVICES=0 python src/train_pt.py \ --model_name_or_path path_to_your_model \ --do_train \ --dataset wiki_demo \ --finetuning_type lora \ --output_dir path_to_pt_checkpoint \ --overwrite_cache \ --per_device_train_batch_size 4 \ --gradient_accumulation_steps 4 \ --lr_scheduler_type cosine...

AILWQ

pending

column names don't match, An error occurred while generating the dataset

May I have some hint about how to solve this question pls： ![image](https://github.com/hiyouga/LLaMA-Efficient-Tuning/assets/71247642/e989b5eb-6afa-4f64-a51f-e9f1e32ef500) The detail：I want to use the dataset format like this in json file： ![image](https://github.com/hiyouga/LLaMA-Efficient-Tuning/assets/71247642/bf718624-16ac-44b3-9709-48c96bbde928) Then I just...

Xin-20

pending

baichuan-7b-sft 使用的什么对话数据呢？

rt, 使用了下昨天开源的baichuan-7b-sft 模型，感觉挺不错的，想请问下训练过程使用了什么对话数据吗？方便公开吗~感谢！！

zmingshi

pending

请问baichuan-7B进行PT+SFT+RLHF的全流程微调的话，需要多少显存呢

YOUNG877

pending

你好，请问怎么样使用deepspeed多卡训练呀，加了deepspeed config后跑不起来

│ 771 │ │ if self.distributed_type == DistributedType.DEEPSPEED: │ │ ❱ 772 │ │ │ config = self.deepspeed_plugin.deepspeed_config │ │ 773 │ │ │ if config.get("fp16", {}).get("enabled", False): │ │...

xxyp

pending

可以用这个库做chatglm的全量微调吗，需要改代码里面那些部分内容

JumpingRain

pending

这是目前看到最全的大模型训练代码

这套代码包含了预训练、rlhf流程，还有lora、qlora技术。真的是很全面了。但如果可以实现多轮对话构建，比如[q1，a1，q2，a2，q3，a3]，构建成训练样本为：prompt：q1*[IGNORE_INDEX]+a1++q2*[IGNORE_INDEX]+a2++q3*[IGNORE_INDEX]，response: a3 就更好了哈哈

nieallen

enhancement

solved

PPO训练报错Tensors must be CUDA and denseTensors must be CUDA and dense

报错： ``` Assistant: Traceback (most recent call last): Traceback (most recent call last): File "/tmp/cct/src/train_ppo.py", line 82, in Traceback (most recent call last): Traceback (most recent call last): File "/tmp/cct/src/train_ppo.py",...

WangRongsheng

pending

可以提供一个可以参考的的accelerate config_file么...accelerate一直启动不起来

command_file: null commands: null compute_environment: LOCAL_MACHINE deepspeed_config: gradient_accumulation_steps: 1 gradient_clipping: 1.0 offload_optimizer_device: none offload_param_device: none zero3_init_flag: true zero3_save_16bit_model: true zero_stage: 3 distributed_type: DEEPSPEED downcast_bf16: 'no' dynamo_backend: 'NO' fsdp_config: {} gpu_ids:...

Chenzongchao

pending

LLaMA-Factory
LLaMA-Factory copied to clipboard

Metadata

reward模型训练loss为0

[Question] 关于增量预训练的几个问题

column names don't match, An error occurred while generating the dataset

baichuan-7b-sft 使用的什么对话数据呢？

请问baichuan-7B进行PT+SFT+RLHF的全流程微调的话，需要多少显存呢

你好，请问怎么样使用deepspeed多卡训练呀，加了deepspeed config后跑不起来

可以用这个库做chatglm的全量微调吗，需要改代码里面那些部分内容

这是目前看到最全的大模型训练代码

PPO训练报错Tensors must be CUDA and denseTensors must be CUDA and dense

可以提供一个可以参考的的accelerate config_file么...accelerate一直启动不起来

← Metadata

Owner

Metadata

LLaMA-Factory LLaMA-Factory copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLaMA-Factory
LLaMA-Factory copied to clipboard