llama2.py
llama2.py copied to clipboard
Is there any CUDA implementation?
This program runs in the CPU. Is there a pure python version that runs in the GPU?