sparseml Add support to total batch size argument for transformers transfer learning

Add support to total batch size argument for transformers transfer learning

Open anmarques opened this issue 3 years ago • 2 comments

Created the NMTrainingArguments that inherits from HF's TrainingArguments. This class allows one to add arguments to the training script and handling potential conflicts with other arguments. In particular, added "total_train_batch_size" and "total_eval_batch_size" and made sure these arguments cannot be used alongside "per_device_train_batch_size" and "per_device_eval_batch_size."

Feb 18 '22 23:02 anmarques

@alexm-nm given how many commits have happened since you set up this PR I would suggest either starting from scratch on new branch or resolving all the conflicts

Mar 30 '22 17:03 spacemanidol

This PR is ready but needs to be merged by hand. Will create a new PR soon and delete this one when done.

May 20 '22 18:05 anmarques

Too much time has passed. Will re-implement if needed

May 04 '23 17:05 anmarques

sparseml sparseml copied to clipboard

Add support to total batch size argument for transformers transfer learning

sparseml
sparseml copied to clipboard