triton-inference-server topic
BiSeNet
Add bisenetv2. My implementation of BiSeNet
yolov4-triton-tensorrt
This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server
onnxruntime_backend
The Triton backend for the ONNX Runtime.
fastDeploy
Deploy DL/ ML inference pipelines with minimal extra code.
stable-diffusion-tritonserver
Deploy stable diffusion model with onnx/tenorrt + tritonserver
clearml-serving
ClearML - Model-Serving Orchestration and Repository Solution
triton_ensemble_model_demo
triton server ensemble model demo
Setup-deeplearning-tools
Set up CI in DL/ cuda/ cudnn/ TensorRT/ onnx2trt/ onnxruntime/ onnxsim/ Pytorch/ Triton-Inference-Server/ Bazel/ Tesseract/ PaddleOCR/ NVIDIA-docker/ minIO/ Supervisord on AGX or PC from scratch.
isaac_ros_dnn_inference
Hardware-accelerated DNN model inference ROS 2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU
YOLOV5_optimization_on_triton
Compare multiple optimization methods on triton to imporve model service performance