VaLM
VaLM copied to clipboard
Evaluation prompts
trafficstars
Hello! I'm trying to replicate your model, and while evaluating the model, I noticed that the prompts don't match those in the paper. Are these the final scripts? Thank you
Hi, one of the 9 prompts for color reasoning in code is different from the paper appendix. I will update the paper accordingly. Please use the prompts in evaluation_scripts/verify_color_prediction.py