GLM-130B
GLM-130B copied to clipboard
6 cards inference
When I tried to inference with 6 cards, I got AssertionError: 32768 is not divisible by 6.
I found 32768 sourced from inner-hidden-size.
Why is the card number should be divisible by 32768?
What shoud I do to make it run with 6 cards?
你这个6张卡是集群部署吗?