mediapipe
mediapipe copied to clipboard
Is Gemma on device really this slow ?
I used llm_inference sample with gemma-2b-it-cpu-int4.bin
on Pixel 8 Pro emulator.
The prefill speed seems to be in minutes.
Pixel 8 Pro configurations:- RAM - 22GB, VM heap - 512mb
Reference video https://github.com/googlesamples/mediapipe/assets/22965002/c7730dba-48e8-4eec-ae68-fe847d2778f2