vall-e
vall-e copied to clipboard
Emilia dataset
have you seen this dataset? maybe it's better suited for zero-shot task, more natural speech than audiobook
https://github.com/open-mmlab/Amphion/blob/main/preprocessors/Emilia/README.md