mesh
mesh copied to clipboard
Regarding data and model parallelism of mnist python code in examples
I have made changes to the mnist.py in the examples section, as documented in the GitHub I have made the changes to achieve data parallelism and model parallelism. I have collected nvprof files for each of them. It seems to be a bit off. Because p2p interaction is happening in data parallelism but not in model parallelism. I went back and checked and re-created the files but still it looks the same. I am attaching the screenshots of nvprof. I have done this using 4 GPU's. I am also attaching the nvprof files.
link for model parallelism nvprof file: https://drive.google.com/open?id=1omQ_neb7eUgmDRnYMmLUyKzD2inO4Kai
link for data parallelism nvprof file: https://drive.google.com/open?id=1MHGdzexNIcV9L66x1VkUQ11DBcM5H_qv