turboderp
Results
1
repositories owned by
turboderp
trafficstars
exllama
1.0k
Stars
136
Forks
Watchers
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.