fastGPT
fastGPT copied to clipboard
Speedup ffn and gelu
On my machine these changes speedup inference from 0.789s to 0.602s.
With caching on, both main and this PR show 0.288s. With caching off, this PR is 0.543s, main is 0.716s.