deep_qa icon indicating copy to clipboard operation
deep_qa copied to clipboard

Figure out a good way to handle increasing the vocabulary of a pre-trained model

Open matt-gardner opened this issue 7 years ago • 0 comments

Say you train a model on SQuAD, then want to fine-tune it on SciQ. Presumably there will be words in SciQ that you have plenty of training data for, but were OOV in SQuAD. How do you handle updating the vocabulary in this setting? This is hard, because you basically need to append new rows onto an existing embedding matrix, which messes with an already-existing computation graph. Not sure at all how to do this, but it'd be pretty nice.

matt-gardner avatar Apr 20 '17 18:04 matt-gardner