afalf issues

Results 5 issues of


                                            afalf

mkqa 是如何转换成检索数据集的

MKQA本身数据集并不包含corpus，请问是如何和nq的corpus做对应的呢

[Bug]: Can not download PaulGrahamEssayDataset

### Bug Description Can not download the PaulGrahamEssayDataset. Meet the connection timeout error, and I also can not visit the llamdahub libary url: https://raw.githubusercontent.com/nerdai/llama-hub/datasets/llama_hub/llama_datasets/library.json ### Version llama-index-0.10.30 ### Steps to...

bug

triage

LV-EVAL评测脚本？

您好，看到您关于长下文Qwen-Agent中提到对LV-EVAL的评测脚本做了修改，可以麻烦提供一下修改后的评测脚本么，想复现一下Qwen-Agent的长上下文效果测试，感谢🙏

Save_prediction overwrite for multi eval_split tasks.

I noticed that the Save_prediction path does not account for the eval_split. Consequently, if a task involves multiple eval_splits, the saved predictions will be overwritten.

bug

LongMemEval results

Hello, LongMemEval is a new benchmark for evaluating the capabilities of memory frameworks. Compared to LoCoMo, it better reflects real-world applications, particularly user-assistant conversations. I was wondering if you have...