cly2625

Results 4 comments of cly2625

![Snipaste_2024-08-15_11-19-45](https://github.com/user-attachments/assets/8a7e769f-8281-4fe5-9ed8-d5ed406d13d2) 我加上参数了,但是出现了这种情况 **stf.yaml:** args: checkpoint_activations: True ## using gradient checkpointing model_parallel_size: 1 experiment_name: lora-disney mode: finetune load: "CogVideoX-2b-sat/transformer" no_load_rng: True train_iters: 200 eval_iters: 1 eval_interval: 100 eval_batch_size: 1 save: ckpts...

> Have you seen your loss when fine-tuning? May it is Nan? The loss is acceptable. The issue is occurring in `sat/SwissArmyTransformer/sat/training/model_io.py` at Line 224: `model._save_checkpoint(save_dir, tag, client_state=client_state, exclude_frozen_parameters=True)`. I...