whisper
whisper copied to clipboard
repeats audio_features tensor, just like tokens tensor, by group size…
… for beam search or best-of-n sampling.
If run with batch of size 1, the current code does not raise an error, because of Pytorch's Broadcasting. But by bigger batch sizes it raises error.