Bert-TextClassification
Bert-TextClassification copied to clipboard
gradient accumulation
Thanks for sharing the code. Your gradient accumulation implementation helps me a lot on my datasets (roughly >10% f1 improvements with very large batch size).
Please check line 87 of train_evaluate.py. I think it should be "train_steps" instead of "step".
Thanks