llm-inference topic

List llm-inference repositories

sagify

434

Stars

69

Forks

Watchers

LLMs and Machine Learning done easily

continuous-deployment

continuous-training

GenerativeAIExamples

2.2k

Stars

428

Forks

36

Watchers

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

gpu-acceleration

large-language-models

runbooks

168

Stars

14

Forks

Watchers

Finetune LLMs on K8s by using Runbooks

kubernetes-operator

lorax

2.1k

Stars

139

Forks

Watchers

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

llm-api

158

Stars

25

Forks

Watchers

Run any Large Language Model behind a unified API

local-llm-function-calling

331

Stars

31

Forks

Watchers

A tool for generating function arguments and choosing what function to call with local LLMs

chatgpt-functions

huggingface-transformers

nos

126

Stars

10

Forks

Watchers

⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.

computer-vision

inference-acceleration

LLM-Minutes-of-Meeting

92

Stars

12

Forks

Watchers

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...

huggingface-transformers

neural-speed

346

Stars

38

Forks

Watchers

An innovative library for efficient LLM inference via low-bit quantization

PalmHill.BlazorChat

39

Stars

9

Forks

Watchers

PalmHill.BlazorChat is a chat application and API built with Blazor WebAssembly, SignalR, and WebAPI, featuring real-time LLM conversations, markdown support, customizable settings, and a responsive d...

blazor-webassembly