AQLM
AQLM copied to clipboard
`__CUDA_ARCH__<=8.0` compilation error fix.
bfloat16 is not supported on T4 and GPU with the same or lower Compute Capability, meaning the kernels will throw an error compiling.
This PR isolates the code behind CC check and throws a runtime error if kernel is not available.