openai-triton topic

List openai-triton repositories

stable-fast

1.2k
Stars
70
Forks
Watchers

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

lightllm

2.3k
Stars
191
Forks
Watchers

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

attorch

523
Stars
28
Forks
Watchers

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.