CALM-pytorch issues

Results 3 CALM-pytorch issues

Sort by recently updated

Possible to load huggingface's pretrained models in anchor_llm & augment_llm?

In the code-snippet below, is it possible to load Decoder/Encoder with pre-trained models from huggingface hub? ``` augment_llm = TransformerWrapper( num_tokens = 20000, max_seq_len = 1024, attn_layers = Decoder( dim...

prashantkodali

Model size for PaLM2 for training

What is the model size (number of trainable parameters) of the below mentioned models used for the experiments in the paper: 1. PaLM2-XXS 2. PaLM2-XS 3. PaLM2-S

udaykarankapur

Memory Usage Increase Issue

When training, there is an issue where memory usage continuously increases during the loss calculation in the following part: **loss = F.cross_entropy( rearrange(logits, 'b n c -> b c n'),...

pluto32

CALM-pytorch
CALM-pytorch copied to clipboard

Metadata

Possible to load huggingface's pretrained models in anchor_llm & augment_llm?

Model size for PaLM2 for training

Memory Usage Increase Issue

← Metadata

Owner

Metadata

CALM-pytorch CALM-pytorch copied to clipboard

Metadata

Possible to load huggingface's pretrained models in anchor_llm & augment_llm?

Model size for PaLM2 for training

Memory Usage Increase Issue

← Metadata

Owner

Metadata

CALM-pytorch
CALM-pytorch copied to clipboard