Ian Upright

Results 3 comments of Ian Upright

In K-Adapter, it seems like what they are doing is adding a number of Transformer layers to the end of the chain, which gets its input from a number of...

This issue should get reopened. I tried evaluating MMLU and got near 25% score with everything. Seems like something in the InContextLearningMultipleChoice stuff is completely broken, but not sure what...

I'm running MMLU. What I don't understand is this collate_fn in InContextLearningMultipleChoiceTaskDataset: for choice in choices: context_enc = preamble['input_ids'] + context['input_ids'] continuation_enc = choice['input_ids'] inp, continuation_span = _make_padded_input(context_enc, continuation_enc, self.max_seq_len,...