sekh77 comments

Repositories
Issues
Comments

Results 3 comments of


                                            sekh77

[Models] Add remaining model PP support

@youkaichao - Is this change now available in version 0.6.2? I have a requirement to load LLaMA 3.2 90B vision model across four GPUs spread across two nodes using pipeline...

[Models] Add remaining model PP support

@DarkLight1337 - Is PP supported for DataBricks DBRX model - databricks/dbrx-instruct?

[Feature]: Pipeline Parallelism support for LLaMA3.2 90B Vision Model

Command that I'm using to load the model: vllm serve meta-llama/Llama-3.2-90B-Vision-Instruct --enforce-eager --max-num-seqs 16 --tensor-parallel-size 4