ggml topic

List ggml repositories
trafficstars

llm

6.1k
Stars
355
Forks
Watchers

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

mpt-30B-inference

575
Stars
94
Forks
Watchers

Run inference on MPT-30B using CPU

inference

8.8k
Stars
765
Forks
8.8k
Watchers

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready...

ggml-js

39
Stars
1
Forks
Watchers

JavaScript bindings for the ggml-js library

minigpt4.cpp

556
Stars
28
Forks
Watchers

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

clip.cpp

435
Stars
29
Forks
Watchers

CLIP inference in plain C/C++ with no extra dependencies

gpu_poor

781
Stars
38
Forks
Watchers

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

ialacol

142
Stars
17
Forks
Watchers

🪶 Lightweight OpenAI drop-in replacement for Kubernetes

LLaMA-Cult-and-More

446
Stars
24
Forks
Watchers

Large Language Models for All, 🦙 Cult and More, Stay in touch !

py.gpt.prompt

29
Stars
5
Forks
Watchers

PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-term memory and task automation.