Mor Zusman
Mor Zusman
Opened a PR fixing this issue https://github.com/microsoft/DeepSpeed/pull/2828
@zhen-jia I also encountered this bug, fixed it by simply changing ATTN_THREADS to 512
> May I get an update regarding the status of this PR? It seems the author stopped working on it? We're currently still working on it, The PR works well,...
AFAIU CI distributed-tests-2-gpus test fails regardless of this PR.
> QQ: Does this PR support parallel sampling (i.e., `n` > 1 in sampling params)? While I don't think it is not necessary to support parallel sampling in this PR,...
Tests failed due to timeouts to HF Ready to be merged