fastmoe
fastmoe copied to clipboard
A bug in switch_gate
Describe the bug
In fmoe/gates/switch_gate.py
line 45:
capacity = math.ceil(cap_rate * inp.shape[0])
should be:
capacity = math.ceil(cap_rate * inp.shape[0] / self.num_expert)
?