davtoro
Results
1
comments of
davtoro
@raikarsagar You are missing around 59.5k hours of training data. Read the paper, main thing of vall-e besides using audio codec is the amount of data they trained on which...