LSTNet
LSTNet copied to clipboard
Towards Local Visual Modeling for Image Captioning
论文里列出了transformer的cider分数131左右,最近我也在用这套代码训练3enc+3dec的纯transformer模型,cider分数只能达到127左右,我看到你代码中python和pytorch版本都非常的低,请问最终表现和版本相关性大吗?谢谢。
Could you please share the code of the attention map in the Visualization? Thank you
Nice Job! We can not find the extracted features for Flickr30k dataset from the link you provided, could you provide it? Thanks!
# Patching CVE-2007-4559 Hi, we are security researchers from the Advanced Research Center at [Trellix](https://www.trellix.com). We have began a campaign to patch a widespread bug named CVE-2007-4559. CVE-2007-4559 is a...