Ian Upright
Ian Upright
In K-Adapter, it seems like what they are doing is adding a number of Transformer layers to the end of the chain, which gets its input from a number of...
This issue should get reopened. I tried evaluating MMLU and got near 25% score with everything. Seems like something in the InContextLearningMultipleChoice stuff is completely broken, but not sure what...
I'm running MMLU. What I don't understand is this collate_fn in InContextLearningMultipleChoiceTaskDataset: for choice in choices: context_enc = preamble['input_ids'] + context['input_ids'] continuation_enc = choice['input_ids'] inp, continuation_span = _make_padded_input(context_enc, continuation_enc, self.max_seq_len,...