Mor Zusman

Results 7 comments of Mor Zusman

Opened a PR fixing this issue https://github.com/microsoft/DeepSpeed/pull/2828

@zhen-jia I also encountered this bug, fixed it by simply changing ATTN_THREADS to 512

> May I get an update regarding the status of this PR? It seems the author stopped working on it? We're currently still working on it, The PR works well,...

AFAIU CI distributed-tests-2-gpus test fails regardless of this PR.

> QQ: Does this PR support parallel sampling (i.e., `n` > 1 in sampling params)? While I don't think it is not necessary to support parallel sampling in this PR,...

Tests failed due to timeouts to HF Ready to be merged