model-inference-service topic

List model-inference-service repositories

BentoML

8.3k
Stars
891
Forks
8.3k
Watchers

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

transformers-nlp-service

43
Stars
3
Forks
Watchers

Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and more

CLIP-API-service

48
Stars
3
Forks
Watchers

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

Efficiently-Serving-LLMs

17
Stars
4
Forks
17
Watchers

Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Pred...