text2vec
text2vec copied to clipboard
关于BGE 微调疑问
Describe the Question
Please provide a clear and concise description of what the question is. 您好,请问您在训练和评估 微调版BGE 时所用的中文STS-B数据集,大概有多少条数据(三元组)呢?
数据release了:https://github.com/shibing624/text2vec/blob/master/examples/data/bge_finetune_data.jsonl
样本制作方法:
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.(由于长期不活动,机器人自动关闭此问题,如果需要欢迎提问)