Zhaoyue Cheng

Results 3 comments of Zhaoyue Cheng

I believe fine tuning can be done on a multi GPU system with accumulating gradients in PyTorch.

I tried to train with the default parameter, but I only get very low F1/ EM after a long time, F1 is around 10 after training for a long time....

yeah, ELMO gives boost for the Bidaf model like 4 points