Long-Context-Data-Engineering It seems the result we get is not the same as the repo shows

It seems the result we get is not the same as the repo shows

Open linbeyoung opened this issue 4 months ago • 12 comments

this is the result we get with the code in this repo. we follow the readme step by step, making sure the environment, model and requirement are the same with the repo, but we are puzzled that we can not have the same score, especially at about 4k-tokens where the score is very low. 我们使用仓库的源代码，模型和环境进行复现，得到的结果是上面这张图，想请问一下可能是哪里出现问题？

Feb 20 '24 16:02 linbeyoung

Long-Context-Data-Engineering Long-Context-Data-Engineering copied to clipboard

It seems the result we get is not the same as the repo shows

Long-Context-Data-Engineering
Long-Context-Data-Engineering copied to clipboard