davtoro

Results 1 comments of davtoro

@raikarsagar You are missing around 59.5k hours of training data. Read the paper, main thing of vall-e besides using audio codec is the amount of data they trained on which...