stylable
stylable copied to clipboard
Stylable - CSS for components
I wonder that how's the performance between byteps and butterfly? Can you offer more experiments of the performance? Thanks a lot.
After this PR, trainer.allreduce_grads will compute the average instead of the sum. Also, I think this line (https://github.com/bytedance/byteps/blob/master/byteps/mxnet/__init__.py#L322) is wrong, as it will ignore `self._scale = self._optimizer.rescale_grad` setting in the...
Support BytePS for MXNet 2.0. cc @eric-haibin-lin
Can byteps support sync batch normalization while train with multi GPU?
Hi, I'm curious about why the REDUCE time is different with BROADCAST time. data:image/s3,"s3://crabby-images/36769/36769d9ce48b6436a7aca757c5926498811b8dd0" alt="微信图片_20200720194020" And, the BROADCAST time of each partition is different with others. data:image/s3,"s3://crabby-images/d960f/d960f11eca13b389119e34635107df1cb3bd5c9e" alt="微信图片_20200720194223" Could the reason be...
When I train mnist with pytorch, I found the output accuracy and loss are werid. Then I tried to print it out before push_pull. data:image/s3,"s3://crabby-images/7489c/7489cc27369ece5878f96b38804137d038b4e46d" alt="Screen Shot 2020-07-18 at 8 24...
Does byteps support GNN training? Or graph computing?
**Describe the bug** I benchmarked BytePS and Horovod's performance using this [script" using 4VM * 8 V100 on TCP. It turned out that the performance I got from BytePS is...
**Describe the bug** Running all instances which are 2 workers, scheduler and server on one node with multiple gpus crashes when one of workers is asked to be run on...
Hi, My understanding is that Byteps uses ps-lite as a thirt party library. Is there any way the bounded delay (as described in ps-lite) could be changed using byteps? Thank...