LLaMA-Factory column names don't match, An error occurred while generating the dataset

column names don't match, An error occurred while generating the dataset

Open Xin-20 opened this issue 1 year ago • 8 comments

May I have some hint about how to solve this question pls：

The detail：I want to use the dataset format like this in json file： Then I just add the dataset info in the dataset_info.json like this： My file are set like this： -baichuan --baichuan-7B ---baichuan-7B --LLaMA-Efficient-Tuning ---data ----alpaca4zh.json The training command： CUDA_VISIBLE_DEVICES=0 python src/train_sft.py
--model_name_or_path /root/baichuan/baichuan-7B/baichuan-7B
--do_train
--dataset alpaca4zh
--finetuning_type lora
--lora_rank 8
--lora_target W_pack
--output_dir alpaca_baichuan
--per_device_train_batch_size 4
--per_device_eval_batch_size 4
--gradient_accumulation_steps 8
--lr_scheduler_type cosine
--logging_steps 10
--save_steps 100
--eval_steps 100
--learning_rate 5e-5
--max_grad_norm 0.5
--num_train_epochs 3.0
--dev_ratio 0.01
--evaluation_strategy steps
--load_best_model_at_end
--plot_loss
--fp16 The bug：