The Speaking Includes Background Music
On the 1.5 model, there is background music with the generated speech. How can this be stopped?
did you figure it out? I use demucs, its opensource and from meta. I use it to split the vocals from background musinc
On the 1.5 model, there is background music with the generated speech. How can this be stopped?
This is an intended artifact from training to watermark the generated audio.
Some voices like the ones that are already in the demo folder of the community version are likely to generate it and it very often happens at the beggining/end meaning it can e cut out manually, otherwise, you'd need to separate it using demucs or the like.
The same goes for other random noises.