FedCV icon indicating copy to clipboard operation
FedCV copied to clipboard

Problems of distributed computing in federated learning

Open rG223 opened this issue 3 years ago • 1 comments

When using distributed operation, I have four Gpus, each of which has a client. During the training process, each GPU has a huge difference. Two gpus even ran out of memory. By the way, I also found that gpu training with overflow was extremely slow and seemed to have gpu utilization close to zero.

rG223 avatar Feb 01 '22 13:02 rG223

@rG223 Please help to provide more details. Thanks.

chaoyanghe avatar Apr 16 '22 16:04 chaoyanghe