afalf
afalf
MKQA本身数据集并不包含corpus,请问是如何和nq的corpus做对应的呢
### Bug Description Can not download the PaulGrahamEssayDataset. Meet the connection timeout error, and I also can not visit the llamdahub libary url: https://raw.githubusercontent.com/nerdai/llama-hub/datasets/llama_hub/llama_datasets/library.json ### Version llama-index-0.10.30 ### Steps to...
您好,看到您关于长下文Qwen-Agent中提到对LV-EVAL的评测脚本做了修改,可以麻烦提供一下修改后的评测脚本么,想复现一下Qwen-Agent的长上下文效果测试,感谢🙏
I noticed that the Save_prediction path does not account for the eval_split. Consequently, if a task involves multiple eval_splits, the saved predictions will be overwritten.
Hello, LongMemEval is a new benchmark for evaluating the capabilities of memory frameworks. Compared to LoCoMo, it better reflects real-world applications, particularly user-assistant conversations. I was wondering if you have...