About multi cards inference.

Open HandsomeWuuu opened this issue 1 year ago • 0 comments

the code of "inference/infer_single.py" can not infer "TRACE-Benchmark/LLM-CL-Benchmark_1000", due to CUDA OOM(A100 8G) and only can work on the single card. it's not reasonable.

I think the code of "inference/infer_multi.py" maybe for multi-card, but it have some bug for infer lora, after i fixed it, the infer results of every task is ""(nothing). maybe the "def prediction(model, infer_dataloader):" have some trouble.

can you check it? thanks for your work and help.

May 21 '24 02:05 HandsomeWuuu