ImageCaptioning.pytorch icon indicating copy to clipboard operation
ImageCaptioning.pytorch copied to clipboard

why don't you add <start> and <end> to the encoding ground truth?

Open tuyunbin opened this issue 6 years ago • 2 comments

Firstly, thank you very much for your very useful Repo. However, I have a question that why don't you add 'start' and 'end' to the encoding ground truth like other methods? So, does the model konw how to end the decode phase?

tuyunbin avatar Jun 17 '19 12:06 tuyunbin

0 is.

ruotianluo avatar Jun 17 '19 14:06 ruotianluo

Thank you for your prompt reply! And I have another question that you said the resulting files are about 200GB, but the feats_att.h5 I extracted via your code is 396.0 GB (395,951,570,760 bytes). Why?

tuyunbin avatar Jun 19 '19 04:06 tuyunbin