Checking we use fused kernels to compute scaled masked softmax on prefix lm

Open thomasw21 opened this issue 4 years ago • 0 comments

Basically re-opening the PR as it seems to pass locally but not CI.

Nov 29 '21 13:11 thomasw21