Murali Andoorveedu
Murali Andoorveedu
### 🚀 The feature, motivation and pitch # Make more operations inplace (GELU, BatchNorm, LayerNorm) ## **Summary** Hi PyTorch team, We would like to enable users to make the following...
This PR adds `send_object_list` and `recv_object_list` to `distributed_c10d.py`. This is extending functionality already present in PyTorch with `broadcast_object_list` that I noticed was missing and decided to upstream. With this change,...
Adds initial pipeline parallelism support to vLLM. ToDo: Milestone 1: POC Prototype - [x] Make changes to support multiple schedulers and cache engines in `worker.py`, `llm_engine.py`, `async_llm_engine.py` and block managers....
This adds docs for pipeline parallel. cc: @simon-mo @youkaichao @njhill
Adding tests for V1 multimodal abort (as requested by @WoosukKwon) as well as load.