models
models copied to clipboard
[QST] What is the proper way to load checkpoint to merlin.models.torch.DLRMModel
❓ Questions & Help
DLRMModel cannot load trained checkpoint properly.
Details
the torch implement of DLRMModel has an interaction layer implement with a member defined in register_buffer in condition block (see https://github.com/NVIDIA-Merlin/models/blob/eb1e54196a64a70950b2a7e7744d2150e052d53e/merlin/models/torch/blocks/dlrm.py#L67). the newly created interaction has no such member until the forward is called. therefore, the checkpoint cannot be loaded properly. what is the recommended way of loading?