apex
apex copied to clipboard
Dose Apex support Transformer or Vision Transformer?
Dose Apex support Transformer or Vision Transformer considering the existence of Layer Norm layers and statistics synchronous across GPUs?