Rahul C

Results 2 repositories owned by Rahul C

gpu_poor

781
Stars
38
Forks
Watchers

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

llama2.c-for-dummies

207
Stars
18
Forks
Watchers

Step by step explanation/tutorial of llama2.c