model-inference topic

List model-inference repositories

BentoML

8.3k
Stars
891
Forks
8.3k
Watchers

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

OpenLLM

9.8k
Stars
626
Forks
Watchers

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

Awesome-EdgeAI

49
Stars
5
Forks
Watchers

Resources of our survey paper "A Systematic Review of AI Deployment on Resource-Constrained Edge Devices: Challenges, Techniques, and Applications"

CLIP-API-service

48
Stars
3
Forks
Watchers

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

Image_captioning

17
Stars
0
Forks
Watchers

Генерация описаний к изображениям с помощью различных архитектур нейронных сетей

edge-tpu-silva

23
Stars
3
Forks
Watchers

Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.

embeddedllm

43
Stars
1
Forks
43
Watchers

EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU