multi-gpu-inference topic

List multi-gpu-inference repositories

inferflow

235
Stars
24
Forks
Watchers

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

DistRL-LLM

21
Stars
1
Forks
21
Watchers

Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization