flashinfer icon indicating copy to clipboard operation
flashinfer copied to clipboard

Error when building flashinfer-jit-cache from source.

Open pavank63 opened this issue 2 months ago • 3 comments

Hi Team, we are trying to build the flashinfer-jit-cache from the source repository and the AOT compilation step currently fails with the following issue

ptxas /tmp/tmpxft_00018a8b_00000000-10_quantization.compute_120.ptx, line 15870; error   : Instruction 'cvt with .e2m1x2' not supported on .target 'sm_120'
ptxas fatal   : Ptx assembly aborted due to errors

I have FLASHINFER_CUDA_ARCH_LIST: "7.5 8.0 8.6 8.7 8.9 9.0a 10.0 10.0a 12.0" and the builds succeeds when i remove 10.0 and 12.0 from the list. Does that mean the sm_100 and sm_120 do not support .e2m1x2 ?

Are there any flags that I can use to skip just .e2m1x2 compilation and still do rest of AOT for sm_100 and sm_120 or should I just remove them from the FLASHINFER_CUDA_ARCH_LIST and jit compile for these two architectures ?

pavank63 avatar Oct 22 '25 14:10 pavank63

ping @yzh119

EmilienM avatar Oct 22 '25 15:10 EmilienM

Hi @pavank63 @EmilienM what's the cuda version you are using?

yzh119 avatar Oct 22 '25 16:10 yzh119

We are using CUDA 12.8.1

pavank63 avatar Oct 22 '25 16:10 pavank63