uniem issues

代码跑着跑着就挂了，CUDA out of memory

1

### 🐛 bug 说明 finetune中途突然OOM，是不是需要限制输入长度呢，请问代码内部会做截断么？目前输入长度没有做限制 torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 96.00 MiB (GPU 0; 31.74 GiB total capacity; 27.71 GiB already allocated; 91.12 MiB free; 31.22 GiB...

susht3

bug

问题

3

### 🐛 bug 说明如果只有Input和正例，那么损失函数是什么呢 ### Python Version None

haizeiwanglf

bug

能不能说明一下显卡要求啊？

3

### 🚀 The feature 在说明里能不能增加一下显卡的要求啊？比如说，哪种数据量级的数据微调时，m3e-small base large 对显卡显存的要求是什么？ 4080 16G、3090 24G这些卡单卡能跑吗？穷人手里没有48G 80G这样的卡。非常感谢大佬们的答复。

pdwfree

enhancement

checkpoint模型无法加载

3

### 🐛 bug 说明保存的checkpoint目录下缺少文件吧？为啥只有3个文件，而完整的 model目录有6个文件这是完整的模型目录： ### Python Version None

gctian

bug

单机多卡运行时报错 has parameters that were not used in producing loss

6

### 🐛 bug 说明 **使用指令** CUDA_VISIBLE_DEVICES=2,3 accelerate launch --num_processes 2 path_to_train_m3e.py path_to_model path_to_dataset \ --output-dir output_dir **报错信息** RuntimeError: Expected to have finished reduction in the prior iteration before starting a...

whi497

bug

微调的一些问题

3

### 🐛 bug 说明 1. 请问微调m3e-base需要多少数据量才有效果，我使用条左右训练，貌似没有效果，微调前后embedding的l2距离是一样 2. 微调时候loss为0 3. 合同签订前，合同签订后。这两个目前我使用m3e 他们的语义很近似，但是在我的业务场景他们应该是最不接近的。微调的很多数据也是类似的后缀的意思不一样但是总体的语义是接近的。这样可以微调吗？谢谢 ### Python Version None

zhouzhou0322