ranzhejiang

Results 2 issues of ranzhejiang

Set `shuffle=True` by default in data_sampler, the discuss is in [#5619](https://github.com/microsoft/DeepSpeed/issues/5619)

- To meet the customer's demands, we need to add the classification function interface of the qwen2 on gaudi - you can test it with following code ```python import torch...

run-test