Sunburst7

Results 4 comments of Sunburst7

# bug report I found a bug when I try to train the model by following the README's instruction ```bash Traceback (most recent call last): File "hydit/train_deepspeed.py", line 531, in...

I also have the same problem when training customer datasets ![Image](https://github.com/user-attachments/assets/1b397f61-330f-4142-b1be-7452a829dcea)

> You can probably resolve the issue by setting the number of classes to n+1. For example, if you have 10 classes, set num_classes = 11 > > “nan" problem...