llama2.c icon indicating copy to clipboard operation
llama2.c copied to clipboard

I found that the dim parameter affects the learning loss and n_layers affects the training speed.

Open win10ogod opened this issue 2 years ago • 0 comments

I found that the dim parameter affects the learning loss and n_layers affects the training speed. 螢幕擷取畫面 2023-10-07 184924 螢幕擷取畫面 2023-10-07 185043 It took 30 minutes. The larger layer only had a loss of 2, but it took 3 hours.

win10ogod avatar Oct 07 '23 10:10 win10ogod