Khachatur

Results 2 comments of Khachatur

@q225yang How long did it take to finetune model with the updated vocabulary, until it reached sufficiently low loss for you ? Isn't it almost equivalent to pretraining if you...