FlexGen
FlexGen copied to clipboard
CPU and M1/M2 GPU platform support
Reopen https://github.com/FMInference/FlexGen/pull/71 which was closed by mistake. Minimal modification to extend FlexGen to CPU and M1/M2 GPU platforms.