llm-infernece topic

List llm-infernece repositories
trafficstars

LLM-TPU

254
Stars
44
Forks
254
Watchers

Run generative AI models in sophgo BM1684X/BM1688

picollm

273
Stars
15
Forks
273
Watchers

On-device LLM Inference Powered by X-Bit Quantization

llm-inference-benchmark

25
Stars
3
Forks
Watchers

LLM 推理服务性能测试