larsupb

Results 2 comments of larsupb

Sounds promising. Key innovations include: - magnitude preservation principles - controlled learning rate decay - eliminating group normalization layers But, since there is also the need of modifying the ADA...