llm-inference topic

List llm-inference repositories

genv

473
Stars
22
Forks
Watchers

GPU environment and cluster management with LLM support

bespoke_automata

213
Stars
25
Forks
Watchers

Bespoke Automata is a GUI and deployment pipline for making complex AI agents locally and offline

inferflow

235
Stars
24
Forks
Watchers

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Exa

23
Stars
3
Forks
Watchers

Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minimal learning curve.

aoororachain

32
Stars
0
Forks
Watchers

Aoororachain is Ruby chain tool to work with LLMs

py-txi

31
Stars
4
Forks
Watchers

A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.

TruthX

114
Stars
5
Forks
Watchers

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Morpheus

187
Stars
141
Forks
Watchers

Morpheus - A Network For Powering Smart Agents - Compute + Code + Capital + Community

ray-educational-materials

339
Stars
63
Forks
Watchers

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

llms-in-prod-workshop-2023

26
Stars
3
Forks
Watchers

Deploy and Scale LLM-based applications