GoogleConceptualCaptioning icon indicating copy to clipboard operation
GoogleConceptualCaptioning copied to clipboard

beam_size == 1 for self-critical decoding?

Open sgondala opened this issue 4 years ago • 1 comments

The paper 'Self-critical Sequence Training for Image Captioning' mentions that our baseline is greedy argmax decoding, which is the same as the inference time technique used.

If that's the case, shouldn't the beam size for inference be always 1? If we just choose argmax at each step, there's just one possible way of forming a sentence right?

sgondala avatar Mar 03 '20 20:03 sgondala

There is no specific connection between baseline and inference method. They can be different.

SCST uses greedy decoding during inference time because beam search doesn't boost the performance much.

ruotianluo avatar Mar 03 '20 20:03 ruotianluo