DB-GPT-Hub
DB-GPT-Hub copied to clipboard
评估中每一列分别代表什么?
请问每一列分别代表什么?我看到写的overall是0.789但是首页标注着一列是median CodeLlama-13b-Instruct-hf_lora 0.789 sft train by our this project,only used spider train dataset, the same eval way in this project with lora SFT. The weights has pulished .
CodeLlama-13B-Instruct base 0.698 0.601 0.408 0.271 0.539 lora 0.94 0.789 0.684 0.404 0.746 qlora 0.94 0.774 0.626 0.392 0.727
对应Method,Easy,Medium,Hard,Extra,All。不同难度的任务,不同的得分。 参考https://yale-lily.github.io/spider 页面下的Data Examples。