serving topic

List serving repositories
trafficstars

sdk-python

24
Stars
3
Forks
Watchers

Python library for Modzy Machine Learning Operations (MLOps) Platform

This repository provides AI/ML service(MachineLearning model serving) modernization solution using Amazon SageMaker, AWS CDK, and AWS Serverless services.

llm-applications

1.7k
Stars
220
Forks
Watchers

A comprehensive guide to building RAG-based LLM applications for production.

genai-ko-LLM

24
Stars
8
Forks
Watchers

This hands-on lab walks you through a step-by-step approach to efficiently serving and fine-tuning large-scale Korean models on AWS infrastructure.

lightgbm-serving

15
Stars
7
Forks
Watchers

A lightweight server for LightGBM

ScaleLLM

484
Stars
37
Forks
484
Watchers

A high-performance inference system for large language models, designed for production environments.

LitServe

1.9k
Stars
118
Forks
Watchers

Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.

happy_vllm

22
Stars
1
Forks
Watchers

A REST API for vLLM, production ready