Oren de Lame

Results 1 issues of Oren de Lame

I'm trying to implement flash attention 4 in CuTile and got stuck on the polynomial exponent. Essentially flash attention 4 uses a polynomial approximation for exp2 in order to reduce...

status: triaged
feature request