composer Augment training batches with "on-the-fly" features

Augment training batches with "on-the-fly" features

Open Riccorl opened this issue 1 year ago • 0 comments

trafficstars

For my use case, I would like to augment the training data with features produced by the model itself. More specifically, my experiment is structured as follows:

Train the model for n steps, after which an Evaluation iteration is performed
Before continuing training, the training set (or the next portion before the next eval step) passes through the model again.
Add the prediction of the model to the training data before the next training iteration

I implemented a Callback for the second step that runs at the end of the evaluations (Event.EVAL_AFTER_ALL) but I'm struggling in propagating the prediction back to the training dataloader. Things that I have tried so far:

Add the prediction directly to the underlying dataset
Having a "shared" object (singleton for now) that stores the predictions

The issues are that (1) the training iterator is not "reloaded" before the end of the epoch and (2) the subprocesses where the Singleton is updated are not the same as those where the batches are computed

Can you provide some guidance?

Apr 03 '24 09:04 Riccorl

composer composer copied to clipboard

Augment training batches with "on-the-fly" features

composer
composer copied to clipboard