YL-9
Results
2
issues of
YL-9
Could you please tell me what the parameters for training each model in train_lm.sh are? Thank you!
你好,我读了context_manager的代码之后有几点问题,能否解答下: 1. get_global_hidden_and_mask 函数中,kv 拼接顺序为什么是 block+init+local,而不是 init+block+local 2. _append 函数中,global_h_q, global_h_k, global_h_v 似乎未使用 rope ~~(代码注释太少,,刚看起来参数都不知道什么含义~~