QI JUN

Results 22 issues of QI JUN

# What does this PR do ? Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will affect] # Changelog -...

### Motivation Currently, we have a pure Python based [PyExecutor](https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/_torch/pyexecutor/py_executor.py) class, which handles the main event loop. It provides good flexibility to support features like overlap scheduler and attention data...

RFC