sparseml icon indicating copy to clipboard operation
sparseml copied to clipboard

Add support to total batch size argument for transformers transfer learning

Open anmarques opened this issue 3 years ago • 2 comments

Created the NMTrainingArguments that inherits from HF's TrainingArguments. This class allows one to add arguments to the training script and handling potential conflicts with other arguments. In particular, added "total_train_batch_size" and "total_eval_batch_size" and made sure these arguments cannot be used alongside "per_device_train_batch_size" and "per_device_eval_batch_size."

anmarques avatar Feb 18 '22 23:02 anmarques

@alexm-nm given how many commits have happened since you set up this PR I would suggest either starting from scratch on new branch or resolving all the conflicts

spacemanidol avatar Mar 30 '22 17:03 spacemanidol

This PR is ready but needs to be merged by hand. Will create a new PR soon and delete this one when done.

anmarques avatar May 20 '22 18:05 anmarques

Too much time has passed. Will re-implement if needed

anmarques avatar May 04 '23 17:05 anmarques