mengzi-retrieval-lm Unable to reproduce PPL for GPT-Neo-125M using lm-eval

Unable to reproduce PPL for GPT-Neo-125M using lm-eval

Open pedrogengo opened this issue 2 years ago • 1 comments

trafficstars

Hey!

I'm trying to run the following command using the lm-eval cli, but I can't reproduce the results you shared. Did you do something different? If not, do you have any idea where I'm doing wrong?

python main.py \
	--model gpt2 \
	--model_args pretrained=EleutherAI/gpt-neo-125M \
	--device 0 \
	--tasks wikitext \
	--batch_size 1

Dec 13 '22 21:12 pedrogengo

May I ask what is the result of the EleutherAI/gpt-neo-125? Did you get an error when using it?

Dec 27 '22 05:12 bling0830

mengzi-retrieval-lm mengzi-retrieval-lm copied to clipboard

Unable to reproduce PPL for GPT-Neo-125M using lm-eval

mengzi-retrieval-lm
mengzi-retrieval-lm copied to clipboard