Open-Sora icon indicating copy to clipboard operation
Open-Sora copied to clipboard

question about parallelism

Open bluenevus opened this issue 1 year ago • 2 comments

I read that we can get command line by doing this. But how do we achieve parallelism so we can just make the api call and that happens with the number of gpus attached, like 2 or 4 gpus

To enable sequence parallelism, you need to use torchrun to run the inference script. The following command will run the inference with 2 GPUs.

I'm making the api call which works great, but watching, its only using one of the gpus despite having 2 attached

bluenevus avatar Jun 26 '24 00:06 bluenevus

What's your command @bluenevus ?

JThh avatar Jul 02 '24 23:07 JThh

This issue is stale because it has been open for 7 days with no activity.

github-actions[bot] avatar Jul 10 '24 01:07 github-actions[bot]

This issue was closed because it has been inactive for 7 days since being marked as stale.

github-actions[bot] avatar Jul 17 '24 01:07 github-actions[bot]