Ben Bolte comments

Results 35 comments of


                                            Ben Bolte

Any idea to get the performance to 70%

I was looking through some of it yesterday and realized my `GESD` implementation was broken. The fixed one is in the repo now, try with that. It may give better...

Any idea to get the performance to 70%

I trained the attention model and printed out some predicted and expected answers, then dumped them in [this gist](https://gist.github.com/codekansas/9429ccfb1675da28f3186892180ba878). You guys can decide for yourself. I'm more or less ready...

Any idea to get the performance to 70%

I noticed the two scripts run for 2000000 (CNN) and 20000000 (LSTM+CNN) batches, I think it must have taken a really long time to train. The results I included were...

Any idea to get the performance to 70%

Wow, I did not realize the Teslas are so fast... I'll just run it for a while on my 980ti I suppose. Character level embeddings though? It looks like regular...

Any idea to get the performance to 70%

I think the performance really depends on how long you run it. I ran a CNN-LSTM model for ~700 epochs and got a precision of 0.52, going to run it...

Any idea to get the performance to 70%

Ended up with ``` Best: Loss = 0.001460216869, Epoch = 879 2016-08-14 05:58:27 :: ----- test1 ----- [====================]Top-1 Precision: 0.564444 MRR: 0.680506 2016-08-14 06:17:06 :: ----- test2 ----- [====================]Top-1 Precision:...

Any idea to get the performance to 70%

17 days seems slow for that GPU? I wonder if it is slow for some reason (maybe it's running on the CPU instead of the GPU?) But 3000 epochs \*...

Output shape of the similarity merge layer is still incorrect

I fixed this just now. I think the output shape should just always be `(None, 1)`. The thing is, I don't think it made a difference. I think the `nan`...

Can you tell me what test1 and test2 of the insurance data set correspond to in the original data?

The names are the same, test 1 and test 2 should be the same as from the papers. The validation data is generated by splitting the training data.

models

I'm still trying things out. In `insurance_qa_eval.py` I loaded some pre-trained embeddings, but I haven't put the embeddings on Github yet. To generate them, I trained Gensim's Word2Vec model to...