fastGPT icon indicating copy to clipboard operation
fastGPT copied to clipboard

Speedup ffn and gelu

Open certik opened this issue 2 years ago • 1 comments

On my machine these changes speedup inference from 0.789s to 0.602s.

certik avatar Mar 08 '23 00:03 certik

With caching on, both main and this PR show 0.288s. With caching off, this PR is 0.543s, main is 0.716s.

certik avatar Mar 17 '23 15:03 certik