austinlaurice

Results 1 issues of austinlaurice

Is this a normal behavior that loss and prior became nan soon after the training process started? I ran the sample code of 20 news group.