LongICLBench
LongICLBench copied to clipboard
Qwen1.5 seems not using NTK
Hi~ Thanks for sharing the evaluation code! I would like to ask where did Qwen1.5 add ntk-interpolation? I traced the code with Qwen2 with huggingface transformers, and found they remove the ntk-interpolation (Qwen1 has ntk but Qwen2/1.5 don't have)
Qwen1.5 should only use RoPE, right?