Jonathan Donnelly comments

Results 12 comments of


                                            Jonathan Donnelly

Training Regime and Backprop

Code to train this model 'from scratch' using data-parallelism across multiple GPUs: https://github.com/jonnykira/openai_reproduction

Can it work on chinese ? how can I train my chinese text dataset to use this?

hi @gitathrun. Are you using python 2 or 3?

Can it work on chinese ? how can I train my chinese text dataset to use this?

Cool, that should work then. For python 2 you would also have to convert the UTF-8 string to a bytearray object within preprocess(). Out of curiosity have you successfully trained...

Train over new data

Hello, Here is code to train a [multiplicative LSTM language model in Tensorflow](https://github.com/jonnykira/Tensorflow_mLSTM) Hope it works! please feel free to leave feedback!

Hello, sorry for the delayed response. I have achieved pretty good performance using a normal distribution for the initial weights. Here is a link to my [Tensorflow Implementation](https://github.com/jonnykira/Tensorflow_mLSTM)

ValueError: all the input arrays must have same number of dimensions

Hello @jozi ! Thank you for pointing this out. I have updated the extract_weights.py script to work with the train_mLSTM.py script as it is now. The Wmb variable was redundant...

Loading model from numpy weights

hello @athon-millane ! Thank you for pointing out this typo! and yes it should be relatively straight forward to initialize the variables in the training script with the pre-trained numpy...

sentiment neuron test

Hello, Sorry for the late reply! I have added a script called extract_weights.py to generate the .npy files in the format you want. All you need to do is pass...

sentiment neuron test

To find the hidden neuron I would try to recreate figure 3 from the paper [Learning to Generate Reviews and Discovering Sentiment](https://arxiv.org/abs/1704.01444) by feeding in the positive and negative IMDB...

Jonathan Donnelly

Training Regime and Backprop

Can it work on chinese ? how can I train my chinese text dataset to use this?

Can it work on chinese ? how can I train my chinese text dataset to use this?

Tips for training on GPU?

Train over new data

Weight Initialization

ValueError: all the input arrays must have same number of dimensions

Loading model from numpy weights

sentiment neuron test

sentiment neuron test