llm-inference topic

List llm-inference repositories

sagify

434
Stars
69
Forks
Watchers

LLMs and Machine Learning done easily

GenerativeAIExamples

2.2k
Stars
428
Forks
36
Watchers

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

runbooks

168
Stars
14
Forks
Watchers

Finetune LLMs on K8s by using Runbooks

lorax

2.1k
Stars
139
Forks
Watchers

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

llm-api

158
Stars
25
Forks
Watchers

Run any Large Language Model behind a unified API

local-llm-function-calling

331
Stars
31
Forks
Watchers

A tool for generating function arguments and choosing what function to call with local LLMs

nos

126
Stars
10
Forks
Watchers

⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.

LLM-Minutes-of-Meeting

92
Stars
12
Forks
Watchers

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...

neural-speed

346
Stars
38
Forks
Watchers

An innovative library for efficient LLM inference via low-bit quantization

PalmHill.BlazorChat

39
Stars
9
Forks
Watchers

PalmHill.BlazorChat is a chat application and API built with Blazor WebAssembly, SignalR, and WebAPI, featuring real-time LLM conversations, markdown support, customizable settings, and a responsive d...