Results 14 comments of xmyu28

deepspeed 0.14.2 transformers 4.40.0. torch 2.1.2 torchvision 0.16.2

my torch version is torch 2.1.2+cu121

02/27 03:06:07 - mmengine - INFO - before_train in EvaluateChatHook. hidden s, am torch.Size([1, 40, 4096]) torch.Size([1, 1, 40, 40]) shape torch.Size([1, 1, 40, 40]) torch.Size([1, 32, 40, 40]) torch.Size([1,...