accelerate icon indicating copy to clipboard operation
accelerate copied to clipboard

feat:Add dataset truncation to prevent deadlocks during training

Open ved1beta opened this issue 6 months ago • 0 comments

What does this PR do?

  • Add truncate_dataset and gradient_steps parameters to BatchSamplerShard
  • Implement optimal dataset size calculation
  • Add truncation logic to both split and non-split batch iterators Fixes #2019

Before submitting

  • [X] Was this discussed/approved via a Github issue or the Please add a link to it if that's the case.
  • [X] Did you write any new necessary tests?

Who can review?

@SunMarc @muellerzr

ved1beta avatar Jun 08 '25 10:06 ved1beta