Artem Kuzmitckii
Results
2
issues of
Artem Kuzmitckii
Fixes for regular and distributed unit tests, including cuda&native code. Also including partial cherry-pick of [release/2.5][SWDEV-489778] NAVI4x UT parity for distributed config (https://github.com/ROCm/pytorch/pull/2327) Fixes #SWDEV-523736
The patch delivers several fixes for building issues for CUDA part of DeepSpeed library. Percentage of passed unit tests improved(tested on RDNA hardware, gfx110x and gfx12x) Before: collected 5298 items...