Kunal Chakrabarty
Results
3
comments of
Kunal Chakrabarty
I see a lot of warnings of the following nature 2022-07-30 01:29:25 | WARNING | root | Rolled back to use the default process group for the reduce scatter operation...
Is the intent of this issue to just rename epoch into something more meaningful and intuitive like shard_index?
Is the expectation of this task to have a check that prevents setting both these flags? Or is it to explore the underlying issue that causes it?