Min

Results 18 comments of Min

Hi, please change the dictionary size to 91612 and it should work. Thanks

Hi @deepakkumar1984 , multi-gpu training is not on my priority list, as SQuAD models usually train in a couple of hours with affordable gpus (gtx1070 etc). But I will definitely...

Hi @tuan3w , implementing ELMo is definitely one of the priorities. I will work on it as soon as I have some time. Thanks

Hi, Try re running the `config.py --mode prepro` if you make changes regarding embedding matrices

It's not that the number of words and chars are different. tf.assign happens when the pretrained weights are loaded. It seems like you are trying to load a trained model...

You can just train the new character embeddings from scratch while using the pretrained weights for other weights. But I don't have that feature implemented here. You might have to...

Hi @mikalyoung , I haven't tried parallelisation across multiple GPUs so I wouldn't know what the best way to go about it is. I heard that data parallelism is easier...

Hi @PANXiao1994 , we have tried putting the projection after the highway network. However, we found it to be overfitting severely and it decreased the performance. If anyone else finds...