Brendan Murphy

Results 2 comments of Brendan Murphy

@haileyschoelkopf I'm interested in taking this on if no one is currently working on it.

Results from running on llama2-7b: ``` lm_eval --model hf --model_args pretrained=meta-llama/Llama-2-7b-chat-hf --tasks commonsense_qa | Tasks |Version|Filter|n-shot|Metric|Value | |Stderr| |--------------|-------|------|-----:|------|-----:|---|-----:| |commonsense_qa|Yaml |none | 0|acc |0.5815|± |0.0141| ```