papernotes The Natural Language Decathlon: Multitask Learning as Question Answering

The Natural Language Decathlon: Multitask Learning as Question Answering

Open howardyclo opened this issue 7 years ago • 1 comments

trafficstars

Oct 19 '18 13:10 howardyclo

This paper present a new multitask question answering network (MQAN) that jointly learns all tasks in ten different NLP tasks. (Cast 10 tasks to question answering)
The model uses dual coattention, multi-head self attention for encoding, and based on pointer-generator network for copying the words from context or question, or generating the words from external vocabulary. No explicit supervision is needed.
Anti-curriculum learning (learn hard tasks first) >>> curriculum learning (hurt performance)
The model can perform zero-shot classification tasks due to the unseen new task is represented as questions, and the unseen classes can be copied from the question. (Meta-learning).
Model and training details are reported.

Oct 19 '18 13:10 howardyclo