Shashank Goel
Results
1
issues of
Shashank Goel
I have trained the model (both MLP and GPT-2) using the CC3M dataset but the loss doesn't seem to decrease very much (stays around 3.0). What loss can I expect...