Daya Khudia
Daya Khudia
Summary: For debugging failure in https://github.com/pytorch/FBGEMM/runs/3098255308 Differential Revision: D29784946
Summary: Windows tests sometimes fail with illegal instructions. I think this should fix it. I believe storeCRegs was getting picked from avx512 implementation. Reviewed By: efiks Differential Revision: D23034784
Adding a transformation in the pipeline once the loader object is constructed results in the following error. ``` Traceback (most recent call last): File "/usr/lib/python3.8/threading.py", line 932, in _bootstrap_inner self.run()...
Conda env creates opencv4.pc (used by pkg-config) while apt install libopencv-dev creates opencv.pc. See https://github.com/libffcv/ffcv/issues/182 for more details.
Adds support for multi-device training using torch_xla. The resnet9 on cifar10 trains fine on 2 GPUs using torch_xla. Some of the issues I ran into while adding this support: -...
This PR adds torch 2.0 based tensor parallel support for the ffn block. It's ported over from https://github.com/mosaicml/examples/pull/255 Currently the trained weights don't match between parallel/no-parallel versions even in a...