lm-evaluation-harness feat: COT trace response handling in evaluator and model classes

feat: COT trace response handling in evaluator and model classes

Open hhh2210 opened this issue 4 months ago • 1 comments

Added support for storing raw generations in HFLM and VLLM models.
Updated the evaluator to log warnings when the length of raw generations does not match processed responses.
Modified response collection to include raw responses when available.

This improves the evaluation process by allowing access to the original generated outputs alongside processed responses.

Aug 03 '25 04:08 hhh2210

All committers have signed the CLA.

Aug 03 '25 04:08 CLAassistant