Efeslab

Results 2 repositories owned by Efeslab
trafficstars

fiddler

165
Stars
16
Forks
Watchers

Fast Inference of MoE Models with CPU-GPU Orchestration

Nanoflow

522
Stars
18
Forks
Watchers

A throughput-oriented high-performance serving framework for LLMs