larsupb
Results
2
comments of
larsupb
Should be fixed now
Sounds promising. Key innovations include: - magnitude preservation principles - controlled learning rate decay - eliminating group normalization layers But, since there is also the need of modifying the ADA...