hezq06

Results 1 issues of hezq06

**Describe the bug** The traditional way of model.eval() seems doesn't work with DeepSpeed Transformer Kernel. The training flag is changed, however, the randomness is still there. **To Reproduce** I've made...

bug
inference