Yufeng Ma comments

Results 11 comments of


                                            Yufeng Ma

cannot visualize attention weights

I think this should be due to pytorch version change. At least we have a matrix of 30x review_len for the weights varialbe for each review. This is the weight...

cannot visualize attention weights

vector = weights.sum( 0 ) vector = vector / vector.sum() att, ids_to_show = vector.sort( 0, descending=True ) Try this. It sums all the 30 attention weights and normalized.

fc_features.pt

I'm sorry that I didn't extract the features here in this version. The fc_features is actually the MLPhidden input to the final classifier. You may just extract this from https://github.com/yufengm/SelfAttentive/blob/406a1f2d5e62eebc7a4b995b68114fb4ea87f98f/model.py#L91...

torch version

I'm sorry that I just implement it in pytorch.

Training on MSCOCO and testing on Flickr

Did you validate or test on Flickr? I haven't looked at results at early epochs. Typically it converges at least after epoch 20.

Training on MSCOCO and testing on Flickr

Because I didn't implement the beam search, so there is still like 2point margin.

How to make a test?

Please refer to https://github.com/yufengm/Adaptive/blob/4c0555af546cdbd49e99ff1bd6e91d1654ae0cd2/train.py#L152 for test on the validation dataset.

is this project completely implement the result of the paper?

Not really. I haven't implemented the beam search part in the paper.

is this project completely implement the result of the paper?

Thanks.

Different tokenizers used on training and validation data

Thanks for your note. I've also spotted this before. But the preprocessing is just following the Karpathy's step, in which he just applied the nltk tokenizer. Hope this helps.