shaochangxu
Results
1
issues of
shaochangxu
According to the step, i run ../examples/pytorch/glm/glm_server.sh on A100 * 8 and i get 2s with one sentence.  But when i start two server A and B, infer time...