serving topic
sdk-python
Python library for Modzy Machine Learning Operations (MLOps) Platform
torchpipe
Serving Inside Pytorch
amazon-sagemaker-model-serving-using-aws-cdk
This repository provides AI/ML service(MachineLearning model serving) modernization solution using Amazon SageMaker, AWS CDK, and AWS Serverless services.
llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
genai-ko-LLM
This hands-on lab walks you through a step-by-step approach to efficiently serving and fine-tuning large-scale Korean models on AWS infrastructure.
lightgbm-serving
A lightweight server for LightGBM
ScaleLLM
A high-performance inference system for large language models, designed for production environments.
LoRA-deployment
LoRA fine-tuned Stable Diffusion Deployment
LitServe
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
happy_vllm
A REST API for vLLM, production ready