Yancheng Wang

Results 1 comments of Yancheng Wang

> 实验中我们一般会降到4-5左右,可以再多训练一下 Do you have any suggestions on the number of epochs in training bge-large-en? We are using a 200 GB dataset. And is there an expected training loss in...