fan
Results
5
comments of
fan
trafficstars
一样的问题
same 2*A100 26B
出现一样问题,同样必须执行TM_DEBUG_LEVEL=DEBUG,模型才能运行。不然多卡出现一张卡占用100%锁死问题。
> @josephrocca @fanghostt > > Can you reproduce it with other models? I can't reproduce it with Qwen2-7B-AWQ or Llama3-70B-AWQ with v0.6.0 on 2 RTX 4090 GPUs. same problem has...