Yancheng Wang comments

Repositories
Issues
Comments

Results 1 comments of


                                            Yancheng Wang

bge small 继续预训练loss

> 实验中我们一般会降到4-5左右，可以再多训练一下 Do you have any suggestions on the number of epochs in training bge-large-en? We are using a 200 GB dataset. And is there an expected training loss in...