TensorRT-LLM
TensorRT-LLM copied to clipboard
Question about Orchestrator mode
Executor api introduces Leader and Orchestrator modes.
Leader works via mpi. How Orchestrator mode is implemented? Does it uses mpi itself? Which mode is preferable for performance: Leader or Orchestrator?