Support the warp-level primitive for ``f64` and `f16`

Open chaoming0625 opened this issue 2 years ago • 0 comments

Currently, we only see the support for warp-level primitives for 32-bit data, such as ti.simt.warp.shfl_sync_f32, ti.simt.warp.shfl_up_f32, ti.simt.warp.shfl_down_f32. These primitives cannot be applied to f64. We are expecting support for float64 primitives.

Dec 26 '23 03:12 chaoming0625