neuralconvo
neuralconvo copied to clipboard
Limit the vocabulary size using word frequencies + remove LR decay
Hi @macournoyer ,
This is a small PR heavily inspired by the code changes done by @chenb67 in her fork and the discussions with you.
- Commented out LR decay (Adam would be taking care of this)
- Limit the vocabulary size on basis of the frequencies of the word
The training is going on with these changes on a small dataset. Will update the results soon.
Have tried to keep the code changes similar to @chenb67 , so that we do not face too many conflicts while merging.
-- Update -- Since the dataset and vocabulary was small, some of the responses are not as good as before, but overall the results are satisfactory ! The perplexity kept on decreasing with epoch and came down to around 2.76 after 50 epochs.
Results: th train.lua --cuda --dataset 25000 --hiddenSize 1000 --maxVocabSize 10000
you> Hi!
neuralconvo> Yes?
you> Hello?
neuralconvo> Hey.
you> How are you?
neuralconvo> I need to know you, hildy.
you> What's your name?
neuralconvo> Smith.
you> How so?
neuralconvo> He was uptown when i