647sherry
Results
2
comments of
647sherry
> @647sherry Our training was conducted on 8 A100-80G gpus, which is two times larger than your setting. For larger models, you could try reducing per_device_train_batch_size as needed, and increase...
the same problem with smartcreation/bge-large-zh-v1.5 (pull from ollama)