LLaVA
LLaVA copied to clipboard
[Question] layer_norm float32?
Question
Is there any reason for 'norm' to set float32?
what if it is set to bfloat16?
Do you think the difference between float32 and bfloat16 in 'norm' operator makes performance gap be bigger?