hezq06
Results
1
issues of
hezq06
**Describe the bug** The traditional way of model.eval() seems doesn't work with DeepSpeed Transformer Kernel. The training flag is changed, however, the randomness is still there. **To Reproduce** I've made...
bug
inference