mengzi-retrieval-lm
mengzi-retrieval-lm copied to clipboard
Unable to reproduce PPL for GPT-Neo-125M using lm-eval
trafficstars
Hey!
I'm trying to run the following command using the lm-eval cli, but I can't reproduce the results you shared. Did you do something different? If not, do you have any idea where I'm doing wrong?
python main.py \
--model gpt2 \
--model_args pretrained=EleutherAI/gpt-neo-125M \
--device 0 \
--tasks wikitext \
--batch_size 1
May I ask what is the result of the EleutherAI/gpt-neo-125? Did you get an error when using it?