TensorRT-LLM
TensorRT-LLM copied to clipboard
[Feature]: AutoDeploy: spec dec TP > 1 tests
🚀 The feature, motivation and pitch
Test 2-model (and later 1-model) spec dec with TP > 1. Maybe this test here can be extended: https://github.com/NVIDIA/TensorRT-LLM/pull/9275/files#r2557977057
Alternatives
No response
Additional context
No response
Before submitting a new issue...
- [x] Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.