Brendan Murphy
Results
2
comments of
Brendan Murphy
@haileyschoelkopf I'm interested in taking this on if no one is currently working on it.
Results from running on llama2-7b: ``` lm_eval --model hf --model_args pretrained=meta-llama/Llama-2-7b-chat-hf --tasks commonsense_qa | Tasks |Version|Filter|n-shot|Metric|Value | |Stderr| |--------------|-------|------|-----:|------|-----:|---|-----:| |commonsense_qa|Yaml |none | 0|acc |0.5815|± |0.0141| ```