GPUStack

Results 4 repositories owned by GPUStack

gguf-parser-go

24
Stars
2
Forks
Watchers

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

gpustack

4.1k
Stars
412
Forks
4.1k
Watchers

GPU cluster manager for optimized AI model deployment

llama-box

293
Stars
29
Forks
293
Watchers

LM inference server implementation based on *.cpp.

vox-box

179
Stars
26
Forks
179
Watchers

A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.