NALU-pytorch
NALU-pytorch copied to clipboard
Clarification on Multiplication results
According to the functional learning results, NAC performs comparable to NALU on the multiplication task. However, there's no means that NAC can learn multiplication. Indeed in the original paper's results, NAC performs even worse than ReLU6. Do you have idea what's happening here?