llama2.c
llama2.c copied to clipboard
mfu calculation
Hi as far as I know
MFU = 6N + 12LHPQ
It is a kind of heustric value not strictlly accurate. I was just wondering is it more precise on the big language model case than the small toy number?