Runfa Chen

Results 3 comments of Runfa Chen

It may be that there is a phenomenon of model collapse. The training schedule can be adjusted according to the dataset, and the checkpoint can be saved for manual inspection...

ce = nn.CrossEntropyLoss(reduction='none')(predict_prob_source, label_source) And why feed after_sotmax(predict_prob_source) to nn.CrossEntropyLoss? This criterion has combined LogSoftmax and NLLLoss.