pytorch-image-models
pytorch-image-models copied to clipboard
Resuming compiled model checkpoint
Sometimes im getting Missing key(s) in state_dict: "stem.conv1.c.weight", [...] Unexpected key(s) in state_dict: "_orig_mod.stem.conv1.c.weight", [...]
while trying to resume training from checkpoint of compiled (--torchcompile=inductor
) model.
This PR solves the issue by stripping _orig_mod.
prefix before loading weights.
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.
@mjamroz does this cover the case of a DDP wrapped torchcompile model? I don't know if i've actually checked that ... is it _orig_mod.module. or just _orig_mod?