ATTEMPT
ATTEMPT copied to clipboard
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
Thank you for the greak work! I wonder if you can provide the exact numbers in different model sizes of each method in Figure 4? Looking forward to hearing from...
the json file (attempt/configs/attempt/eval_glue.json) may forget to add the dataset stsb and cola in the paper?And the dataset wnli which is not in the paper and I think that the...
Thanks for sharing your wonderful work! I wonder about the few-shot learning setting of ATTEMPT. Did you run a limited number of gradient descent for each datapoint (e.g. 10 GD...
In the multi-task target traing, the file "multi_task_superglue.json" should set the param "shared_attn" true, otherwise the train dataset didn't have the task_ids.