mengzi-retrieval-lm icon indicating copy to clipboard operation
mengzi-retrieval-lm copied to clipboard

Unable to reproduce Langboat/ReGPT-125M-200G‘s PPL result.

Open dumpmemory opened this issue 2 years ago • 2 comments
trafficstars

I have followed the training data setting as discussed in #9 and i used training the model with 200 retrieval index and https://github.com/Langboat/mengzi-retrieval-lm/blob/main/train/config.json. But I can't reproduce the ppl as pretrained model Langboat/ReGPT-125M-200G. Is there any thing i missing ?

Thanks.

dumpmemory avatar Dec 28 '22 08:12 dumpmemory

We checked the parameters in config.json, learning_rate should be set to 2e-4.

More detailed model parameters can be found in training_args.bin in huggingface.co/Langboat/ReGPT-125M-200G model file.

Thanks for pointing this out!

bling0830 avatar Dec 29 '22 10:12 bling0830

Thanks for your reply. I will try again.

dumpmemory avatar Dec 29 '22 14:12 dumpmemory