WENG JIA HONG
Results
1
issues of
WENG JIA HONG
### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction 請問一下問甚麼我在做二次預訓練的時候中途loss值為突然性的上升,請問怎麼回事呢 從一開始的2.多跑到5.多 ### Expected behavior CUDA_VISIBLE_DEVICES=0 python src/train_bash.py --stage pt --do_train True --model_name_or_path meta-llama/Meta-Llama-3-8B...