examples
examples copied to clipboard
Testing a C++ case with MPI failed.
š Describe the bug
I am testing the following example:
https://github.com/pytorch/examples/blob/main/cpp/distributed/dist-mnist.cpp
I get the following error:
[ 50%] Building CXX object CMakeFiles/awcm.dir/xdist.cxx.o /home/alamj/TestCases/tests/xtorch/xdist/xdist.cxx:1:10: fatal error: c10d/ProcessGroupMPI.hpp: No such file or directory 1 | #include <c10d/ProcessGroupMPI.hpp>
I changed the top line with full path to ensure that hpp file gets available #include </project/def-alamj/shared/libtorch/include/torch/csrc/distributed/c10d/ProcessGroupMPI.hpp>
The new error indicates something else I need to know, which is given in the tutorial.
[ 50%] Building CXX object CMakeFiles/awcm.dir/xdist.cxx.o /home/alamj/TestCases/tests/xtorch/xdist/xdist.cxx:38:21: error: āc10dā was not declared in this scope; did you mean āc10ā? 38 | std::shared_ptrc10d::ProcessGroupMPI pg, | ^~~~ | c10
Please let me know how do I get a work around to fix this.
Error logs
No response
Minified repro
No response
Versions
I think this field is not needed as I am running C++ code.
cc @ezyang @msaroufim @bdhirsh @anijain2305 @zou3519