Zach Mueller comments

Results 471 comments of


                                            Zach Mueller

Feature Request: Support MS-AMP

@winglian not quite yet! But I'll let you know for you to test :) (should be by end of this week!)

Feature Request: Support MS-AMP

@winglian go ahead and try the branch out :) Note that it only works on single GPU for now (will look at deepspeed tommorow), and you shouldn't see a time...

Feature Request: Support MS-AMP

Correct. I only tested on a tiny model just to get the API stable 😉

Feature Request: Support MS-AMP

Now that it’s a bit more stable, I saw both memory decreases and speed increases when combining MS-AMP and TransformerEngine. More details are in the PR (so overall purely positives)

Feature Request: Support MS-AMP

Correct, I'm looking into that this week

Barebones dataloader to allow for any type of iterable dataloader-like object to be used. Should just handle device placement

@alex-jw-brooks the idea behind this is indeed as you say :) Flag would be better, and do note that realistically `dispatch_batches` or `split_batches` shouldn't do *anything*, this is full user...

Zach Mueller

Feature Request: Support MS-AMP

Feature Request: Support MS-AMP

Feature Request: Support MS-AMP

Feature Request: Support MS-AMP

Feature Request: Support MS-AMP

Barebones dataloader to allow for any type of iterable dataloader-like object to be used. Should just handle device placement

Assertion `index >= -sizes[i] && index < sizes[i] && "index out of bounds"` failed.

Assertion `index >= -sizes[i] && index < sizes[i] && "index out of bounds"` failed.

Gradient accumulation yields worse results than the equivalent batch size

Gradient accumulation yields worse results than the equivalent batch size