OpenDiT
OpenDiT copied to clipboard
读取数据报错,numworker改成0才稳定,请问是什么问题?
Traceback (most recent call last):
File "xx/OpenDiT-master/train.py", line 383, in
How did you launch your script?
I think memory may leak, get bigger than 600G, so been killed
目前是发现跑几十万数据,内存一直增长
it about torch dataloader. you can use gc collect to avoid this problem