Open-Sora question about parallelism

question about parallelism

Open bluenevus opened this issue 1 year ago • 2 comments

I read that we can get command line by doing this. But how do we achieve parallelism so we can just make the api call and that happens with the number of gpus attached, like 2 or 4 gpus

To enable sequence parallelism, you need to use torchrun to run the inference script. The following command will run the inference with 2 GPUs.

I'm making the api call which works great, but watching, its only using one of the gpus despite having 2 attached

Jun 26 '24 00:06 bluenevus

What's your command @bluenevus ?

Jul 02 '24 23:07 JThh

This issue is stale because it has been open for 7 days with no activity.

Jul 10 '24 01:07 github-actions[bot]

This issue was closed because it has been inactive for 7 days since being marked as stale.

Jul 17 '24 01:07 github-actions[bot]

Open-Sora Open-Sora copied to clipboard

question about parallelism

Open-Sora
Open-Sora copied to clipboard