CMKP
CMKP copied to clipboard
about calculating vocab dist
In your paper, the input of the word distribution over vocabulary is a context-rich representation c_t=[u_t; s_t; c_text + c_fuse]
. But in your code, it seems that you only concate s_t
and c_text+c_fuse
. Is it a clerical error in paper or something I didn't notice?