turboderp
Results
1
repositories owned by
turboderp
exllama
1.0k
Stars
136
Forks
22
Watchers
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.