boinc icon indicating copy to clipboard operation
boinc copied to clipboard

Multi GPU scheduling issue

Open Rytiss opened this issue 2 years ago • 5 comments

Describe the bug Plan_class_spec allows specifying ngpus>1. However, everything on the BOINC client side assumes ngpus<=1: app_init_data.gpu_device_num is int, wrapper's gpu_device_num is int, etc.

This makes it impossible to run a multi-GPU application, or alternatively, makes such an application use all GPUs if --device flag/aid.gpu_device_num is ignored to gain access to more than one GPU.

Expected behavior gpu_device_num should be an array of devices in multi-gpu context.

Rytiss avatar Mar 10 '23 13:03 Rytiss

@davidpanderson, could you please take a look?

AenBleidd avatar Mar 12 '23 17:03 AenBleidd

This would be a major change, and I haven't seen any multi-GPU apps.

davidpanderson avatar Mar 13 '23 01:03 davidpanderson

Would love multi-GPU support. These days any machine with a GPU in a pcie port probably also has an onboard GPU. Multi-gpu support would also be important for AI projects that want to use BOINC in some way. Maybe not top development priority but sure would be cool :)

makeasnek avatar Mar 19 '23 22:03 makeasnek

Not sure any application could use two different (especially from the different vendors) GPUs at the same time. From the other hand, there are not so much people who have two and more same GPUs at the same time installed. So I don't really think any application would benefit a lot from such a configuration. So this ticket is mostly to fix inconsistency between client and server, but definitely not to add multi-GPU support.

AenBleidd avatar Mar 19 '23 23:03 AenBleidd

Count me in as a multi-gpu user on 5 hosts. All the same cards. Same for my teammates. Oh, also about another hundred or so BOINC volunteers who are also contesters. Like with the current BOINC Pentathlon running now.

KeithMyers avatar May 16 '23 20:05 KeithMyers