whisper repeats audio_features tensor, just like tokens tensor, by group size…

repeats audio_features tensor, just like tokens tensor, by group size…

Open Majdoddin opened this issue 1 year ago • 0 comments

… for beam search or best-of-n sampling.

If run with batch of size 1, the current code does not raise an error, because of Pytorch's Broadcasting. But by bigger batch sizes it raises error.

Oct 14 '24 17:10 Majdoddin