lightning-thunder icon indicating copy to clipboard operation
lightning-thunder copied to clipboard

remove jit(fsdp(model)) codepath

Open t-vi opened this issue 1 year ago • 6 comments

The old codepath is not composable with other transforms, does not offer gathering of state dicts as easily etc.

Removing, of course depends on NVIDIA benchmarking not needing it. I think we (@crcrpar actually) switched a couple of months ago or so.

@mpatel31415 @tfogal @crcrpar @IvanYashchuk wdyt?

t-vi avatar Sep 09 '24 15:09 t-vi

Brought up in triage review. Assigned to Tom V.

nvMelissa avatar Sep 09 '24 15:09 nvMelissa

I'd say the removal should be after https://github.com/Lightning-AI/lightning-thunder/issues/1051 is fixed.

crcrpar avatar Sep 09 '24 15:09 crcrpar

@crcrpar Thank you for pointing that out!

So what kind of delay should we have to be sure the benchmarking works without it?

t-vi avatar Sep 11 '24 11:09 t-vi

We already use the new node, which contains: distributed_first := (self.compile in ("eager", "inductor") or "dynamo" in self.compile):

so from Mixology perspective it should be fine.

mpatel31415 avatar Sep 11 '24 11:09 mpatel31415

Just to clarify my take then, I was just conservative as I wasn't super confident if the constraint written in the issue linked in my previous comment would be fixed clearly.

crcrpar avatar Sep 11 '24 13:09 crcrpar

@crcrpar @mpatel31415 So with a few more weeks, are we more confident?

t-vi avatar Sep 30 '24 08:09 t-vi