snli-entailment The alpha value is not expected!

The alpha value is not expected!

Open chungfu27 opened this issue 7 years ago • 1 comments

Hi @shyamupa , Thanks for your attention model!! I can get the alpha value to visualize the machine attention level for my task. But I found a strange phenomenon about alpha value. The following picture is the heatmap output of "flat_alpha" layer: attention_flat_alpha_export It looks well!!! But I exported the output of "alpha" layer (through softmax), I got this follwoing result: attention_alpha_softmax_export I know softmax will sharpen and normalize the result, but I also used flat_alpha data to do softmax function in my local and the following result is different from the output of "alpha" layer: attention_alpha_softmax_local The heatmap shape is (20, 200), there are 20 sentences and every sentence length is 200. Do you have any suggestion for this?

Apr 07 '17 07:04 chungfu27

Thanks for your attention model again. I agree with @chungfu27. The paper described the model like this, "word-by-word attention based on all output vectors of the hypothesis (h7, h8 and h9)", but in the codes, I think you get the attention only based on the last output vector.

May 04 '17 09:05 Maggione

snli-entailment snli-entailment copied to clipboard

The alpha value is not expected!

snli-entailment
snli-entailment copied to clipboard