RuohaoYan
RuohaoYan
regret
> CUDA_LAUNCH_BLOCKING=1 python python infer_finetuning.py upload the error log CUDA error: out of memory Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
> put all log , let me kown what happen. BloomConfig { "_name_or_path": "model/bloom-560m/config.json", "apply_residual_connection_post_layernorm": false, "architectures": [ "BloomForCausalLM" ], "attention_dropout": 0.0, "attention_softmax_in_fp32": true, "bias_dropout_fusion": true, "bos_token_id": 1, "eos_token_id": 2,...
> try use cpu load , it is just CUDA error: out of memory. Yes. But as I said above, I modified torch.load and it worked on both cpu and...
No, I give up it