Yue Zheng

Results 1 comments of Yue Zheng

Hello, why is self.ln_pre used here instead of self.ln_post? In def encode_text, self.ln_final is used, which represents the output layer_norm.