llm-inference topic

List llm-inference repositories

OpenLLM

9.8k
Stars
626
Forks
49
Watchers

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

autogen

35.7k
Stars
5.2k
Forks
415
Watchers

A programming framework for agentic AI 🤖 (PyPi: autogen-agentchat)

mistral-inference

9.6k
Stars
846
Forks
103
Watchers

Official inference library for Mistral models

LLM.swift

332
Stars
34
Forks
Watchers

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

spatten

66
Stars
7
Forks
Watchers

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

friendli-client

40
Stars
8
Forks
Watchers

Friendli: the fastest serving engine for generative AI

llm-vscode-inference-server

52
Stars
8
Forks
Watchers

An endpoint server for efficiently serving quantized open-source LLMs for code.

tree-prompt

29
Stars
3
Forks
Watchers

Tree prompting: easy-to-use scikit-learn interface for improved prompting.

llm-sharp

39
Stars
6
Forks
Watchers

Language models in C#