647sherry

Results 2 comments of 647sherry

> @647sherry Our training was conducted on 8 A100-80G gpus, which is two times larger than your setting. For larger models, you could try reducing per_device_train_batch_size as needed, and increase...

the same problem with smartcreation/bge-large-zh-v1.5 (pull from ollama)