su-heyang

Results 2 issues of su-heyang

train_sft.py训练指令: CUDA_VISIBLE_DEVICES=0 python src/train_sft.py \ --model_name_or_path /data1/projects/baichuan-7B/ \ --do_train \ --dataset alpaca_gpt4_zh \ --finetuning_type lora \ --output_dir output \ --overwrite_cache \ --per_device_train_batch_size 4 \ --gradient_accumulation_steps 4 \ --lr_scheduler_type cosine \...

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior 我使用本地模型路径,但仍然去.cache/huggingface/目录里去找模型文件,从而找不到模型文件 ### Expected Behavior _No response_ ### Steps To Reproduce run...