triton
triton copied to clipboard
tl.dot error when tile sizes < 16
tl.dot seems to not support when accumulator tile size < 16 (https://github.com/openai/triton/blob/854677046383bb3f0a30f3b2ba981b91fb9fb29f/python/triton/language/semantic.py#L1355C47-L1357C124)
May I know what's the reason?
See https://github.com/openai/triton/issues/3212
I'm looking into it but I don't have much experience with Triton, so this may take a while