Halide icon indicating copy to clipboard operation
Halide copied to clipboard

Implement Sapphire Rapids features

Open jwlawson opened this issue 4 years ago • 3 comments

It would be good to have support for the additional AVX features added with Sapphire Rapids, including VNNI, BF16 and AMX support.

  • [x] Add Sapphire Rapids target feature (#5677)
    • [x] Extend CPUID support in src/runtime to allow querying cpuid(eax=7, ecx=1) (#5684, #5702)
  • [x] Add support for avx512 bf16 (see Intrinsics guide)
    • [x] Add support for vcvtneps2bf16 (#5677)
    • [x] Provide wrapper for 128bit version of vcvtneps2bf16, as LLVM does not provide an unmasked version of the intrinsic (#5704)
    • [x] Add support for vcvtne2ps2bf16 (#5711)
    • [x] Add support for bf16 dot product (#5712)
  • [ ] Add support for avx512 VNNI (see Intrinsics guide)
  • [ ] Add support for AMX matrix extensions (see Intrinsics guide)

jwlawson avatar Feb 01 '21 13:02 jwlawson

Whoops didn't mean to edit, was just trying the check boxes...

dsharletg avatar Feb 01 '21 18:02 dsharletg

Sapphire Rapids also supports AVX512-FP16, but I noticed that it’s not included in the detected feature list. Is there a specific reason for this exclusion?

changhoon-sung avatar Mar 14 '25 18:03 changhoon-sung

There is no specific reason. It's just not yet implemented. PRs welcome.

abadams avatar Mar 14 '25 18:03 abadams