llama2.c icon indicating copy to clipboard operation
llama2.c copied to clipboard

Mojo version?

Open MackNcD opened this issue 2 years ago • 2 comments

https://github.com/tairov/llama2.mojo

Imagine 250x speed on the original...

MackNcD avatar Sep 12 '23 22:09 MackNcD

Imagine 250x speed on the original...

Probably you've noticed, this original means llama2.py, not llama2.c, I am interested in Mojo, though it requires minimum 8GiB RAM only for SDK.

espio999 avatar Sep 12 '23 23:09 espio999

The C version is faster when using multi-threading :fire:

performance comparison

kroggen avatar Sep 13 '23 18:09 kroggen