Oren de Lame
Results
1
issues of
Oren de Lame
I'm trying to implement flash attention 4 in CuTile and got stuck on the polynomial exponent. Essentially flash attention 4 uses a polynomial approximation for exp2 in order to reduce...
status: triaged
feature request