nhatkhtn

Results 2 comments of nhatkhtn

@wangyifei0047 I believe you are mistaken. In the current version (2.15.4), the normalization layer used at the start of each layer for Llama is **RMS**, not RMSPre as you pointed...