open_llama icon indicating copy to clipboard operation
open_llama copied to clipboard

What learning rate was used to pretrain 3B model?

Open itsnamgyu opened this issue 7 months ago • 0 comments
trafficstars

itsnamgyu avatar Apr 18 '25 16:04 itsnamgyu