Bert-TextClassification gradient accumulation

gradient accumulation

Open Atomu2014 opened this issue 5 years ago • 0 comments

Thanks for sharing the code. Your gradient accumulation implementation helps me a lot on my datasets (roughly >10% f1 improvements with very large batch size).

Please check line 87 of train_evaluate.py. I think it should be "train_steps" instead of "step".

Thanks

Jan 17 '20 06:01 Atomu2014

Bert-TextClassification Bert-TextClassification copied to clipboard

gradient accumulation

Bert-TextClassification
Bert-TextClassification copied to clipboard