jan
jan copied to clipboard
epic: Jan <> NVIDIA Triton & TensorRT LLM
- [ ] Step by Step documentation for Jan <> Triton inference server with TensorRT-LLM - 1 day
- [ ] Updated code in
triton-tensorrt-llm
extension to support - 1 day