CALM-pytorch Possible to load huggingface's pretrained models in anchor_llm & augment

Possible to load huggingface's pretrained models in anchor_llm & augment_llm?

Open prashantkodali opened this issue 1 year ago • 5 comments

In the code-snippet below, is it possible to load Decoder/Encoder with pre-trained models from huggingface hub?

augment_llm = TransformerWrapper(
    num_tokens = 20000,
    max_seq_len = 1024,
    attn_layers = Decoder(
        dim = 512,
        depth = 12,
        heads = 8
    )
)

anchor_llm = TransformerWrapper(
    num_tokens = 20000,
    max_seq_len = 1024,
    attn_layers = Decoder(
        dim = 512,
        depth = 2,
        heads = 8
    )
)