VBench icon indicating copy to clipboard operation
VBench copied to clipboard

long prompt evaluation

Open ywlq opened this issue 4 months ago • 1 comments

I am using the augment's long prompt for evaluation. Since the prompt is too long, I'm using the corresponding short prompt as the filename. When conducting the evaluation, should I directly use the default prompt_file from evaluate_long.sh, or do I need to customize the prompt_file? If customization is needed, what should be the format of the prompt_file?

ywlq avatar Sep 11 '25 04:09 ywlq

Hi, for the filename, please use the original short prompt directly, since that’s what the evaluation pipeline will look up.

ziqihuangg avatar Sep 17 '25 02:09 ziqihuangg