llama2.c Code/script to reproduce val loss using the shared models

Code/script to reproduce val loss using the shared models

Open Alexey234432 opened this issue 1 year ago • 3 comments

Hi,

does anyone know if there is a script/code to reproduce val loss using provided "*.bin" models? I've tried myself and can't get the numbers shared.

Thank you.

Jan 25 '24 18:01 Alexey234432

Same issue here.

Feb 05 '24 12:02 DavidHerel

in my case loss values are slightly higher - is it the same for you? ie 1.072 for 15M model is my case is 1.0833 and 0.760 for 110M model jumps to 0.8725 @DavidHerel

Thank you

Feb 05 '24 18:02 Alexey234432

Yeah, I think it was something similar to you.

I did not play with lr, warmup and dropout, so maybe more extensive hyperparams search will get us the results?

Feb 06 '24 11:02 DavidHerel

llama2.c llama2.c copied to clipboard

Code/script to reproduce val loss using the shared models

llama2.c
llama2.c copied to clipboard