SpeechT5
SpeechT5 copied to clipboard
Why can WavLLM understand audio sounds as well?
Hi, I tested and found that WavLLM can sometimes understand audio sounds too. Seeing that all the training data mentioned in the paper are speech-related, I just wonder where comes this capability please?