LiDAR-MOS
LiDAR-MOS copied to clipboard
About the issue of multi-GPU training.
My server has four NVIDIA 4090 GPUs. Single-card training doesn't throw any errors, but when the batch size is changed to 2 for single-card training, it throws an error after completing just one epoch. No other parameters have been changed. I wanted to try multi-GPU training, but it keeps throwing errors. I searched online for solutions, but none of them seem to resolve the issue. The error message is as follows:
Traceback (most recent call last):
File "./train.py", line 186, in