llm-infernece topic

List llm-infernece repositories

LLM-TPU

115
Stars
19
Forks
Watchers

Run generative AI models in sophgo BM1684X

picollm

162
Stars
6
Forks
Watchers

On-device LLM Inference Powered by X-Bit Quantization

llm-inference-benchmark

25
Stars
3
Forks
Watchers

LLM 推理服务性能测试