OmniQuant icon indicating copy to clipboard operation
OmniQuant copied to clipboard

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:7 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)

Open forcekkk opened this issue 8 months ago • 0 comments

I want to use 8 4090(24G) to quantize the W4A4 for llama-7b, but it will have this error.

python main.py
--model ./llama-7b --epochs 1 --output_dir ./log/llama-7b-w4a4 --eval_ppl --wbits 4 --abits 4 --lwc --let --multigpu --tasks piqa,arc_easy,arc_challenge,boolq,hellaswag,winogrande

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:7 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)

forcekkk avatar Apr 11 '25 05:04 forcekkk