atlantic8
atlantic8
same problem
I only have training data with format: sentence1, sentence2, label so I cannot construct training data with format: query=xxx, pos=[], neg=[]
Also, when I trying to train using train.py, with "--fp16 True --gradient_accumulation_steps 3", I got out of GPU memory. I was using A100 40G. why training this model takes this...
btw, this model can be trained only when per_device_train_batch_size is set to 2