FasterTransformer
FasterTransformer copied to clipboard
decoupled mode not working when beam_width > 1
I am running t5 decoder model with fastertransformer, it seems that, if I set beam_width>1, the result that is stream back are just garbage tokens. Is this expected?