composer icon indicating copy to clipboard operation
composer copied to clipboard

Add Low Precision LayerNorm

Open nik-mosaic opened this issue 3 years ago • 0 comments
trafficstars

[WIP] Will replace Fused LayerNorm, since Fused LayerNorm's speedup comes from running in low precision mode. Equivalent converge performance has been verified on standard NLP models (Bert, GPT).

Next commits:

  • Resolve type issues
  • Add test: tests/algorithms/test_low_precision_layernorm.py
  • Remove FusedLayerNorm code.

nik-mosaic avatar Sep 13 '22 10:09 nik-mosaic