llama2.c
llama2.c copied to clipboard
Code/script to reproduce val loss using the shared models
Hi,
does anyone know if there is a script/code to reproduce val loss using provided "*.bin" models? I've tried myself and can't get the numbers shared.
Thank you.
Same issue here.
in my case loss values are slightly higher - is it the same for you? ie 1.072 for 15M model is my case is 1.0833 and 0.760 for 110M model jumps to 0.8725 @DavidHerel
Thank you
Yeah, I think it was something similar to you.
I did not play with lr, warmup and dropout, so maybe more extensive hyperparams search will get us the results?