RuohaoYan comments

Results 5 comments of


                                            RuohaoYan

Take a look at the code, it stopped working and the basic example of your code no longer works

regret

CUDA error: out of memory CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.

> CUDA_LAUNCH_BLOCKING=1 python python infer_finetuning.py upload the error log CUDA error: out of memory Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

CUDA error: out of memory CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.

> put all log , let me kown what happen. BloomConfig { "_name_or_path": "model/bloom-560m/config.json", "apply_residual_connection_post_layernorm": false, "architectures": [ "BloomForCausalLM" ], "attention_dropout": 0.0, "attention_softmax_in_fp32": true, "bias_dropout_fusion": true, "bos_token_id": 1, "eos_token_id": 2,...

CUDA error: out of memory CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.

> try use cpu load , it is just CUDA error: out of memory. Yes. But as I said above, I modified torch.load and it worked on both cpu and...

ModuleNotFoundError: No module named 'auto_gptq'

No, I give up it