DeepSpeed
DeepSpeed copied to clipboard
Prevent hangs in CI during parallel run compilation
We are seeing random hangs, which we believe are caused by multiple pytest processes trying to compile the same code at once creating a deadlock. This PR sets a seperate TORCH_EXTENSIONS_DIR for each pytest process to avoid this.
@tjruwase