Bruising6802
Results
2
comments of
Bruising6802
In `the_annotated_transformer.py` on line `357`. In the function documentation it even says that the norm was moved.
Maybe it's best to mention this issue in the notebook, because it causes confusion for many.