tvm icon indicating copy to clipboard operation
tvm copied to clipboard

Update scan.py to fix pascal error

Open rankaiyx opened this issue 8 months ago • 2 comments

https://github.com/mlc-ai/mlc-llm/issues/3231

When I use mlc-llm, I encounter error in linking. I solved the problem with this patch. This patch may not solve the root cause. But for my scene it is suitable, the performance loss is at the noise level. The root cause may be the lack of relevant instructions in Pascal, or a bug in Nvidia's Thrust library. This patch is intended as an emergency mitigation. Looking forward to a better way.

rankaiyx avatar Jul 03 '25 07:07 rankaiyx

When using pascal, can_use_thrust(target, "tvm.contrib.thrust.sum_scan") returns True, but will actually fail.

rankaiyx avatar Jul 04 '25 01:07 rankaiyx

Could you include sm_61 as well. NVIDIA Tesla P40 has the same issue

gatepoet avatar Aug 25 '25 10:08 gatepoet