Yue Zheng
Results
1
comments of
Yue Zheng
Hello, why is self.ln_pre used here instead of self.ln_post? In def encode_text, self.ln_final is used, which represents the output layer_norm.