litgpt icon indicating copy to clipboard operation
litgpt copied to clipboard

Stricter loading of checkpoints

Open carmocca opened this issue 2 years ago • 0 comments

Strict loading is useful as it enforces that your checkpoint gets loaded as you expect.

In the case of the fine-tuned checkpoints, we can merge them to the pre-trained one before loading so that strict=True can be used.

Additionally, we only set strict=False if quantization is used. This is because of https://github.com/Lightning-AI/lit-parrot/pull/72#issuecomment-1567317405

carmocca avatar Jun 14 '23 16:06 carmocca