local-inference topic

List local-inference repositories

PowerInfer

7.9k
Stars
406
Forks
73
Watchers

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

IALab-Suite

18
Stars
1
Forks
Watchers

Tool for test diferents large language models without code.

fiddler

165
Stars
16
Forks
Watchers

Fast Inference of MoE Models with CPU-GPU Orchestration