LightningFastSpeech2 icon indicating copy to clipboard operation
LightningFastSpeech2 copied to clipboard

Results 4 LightningFastSpeech2 issues
Sort by recently updated
recently updated
newest added

This project has gotten quite complex, and good documentation is needed. I will go with MKDocs again.

Metrics (such as DTW distance, mae, etc) should a) be modular to allow adding new ones easily b) include wasserstein distance and others

There should be parameter groups (with a certain prefix, e.g. "fastdiff_vocoder"), as well as config files. It might be worth looking into using an existing project for this or creating...

Currently e.g. dataset caching is handled in training code, which it shouldn't be. It is worth considering separating the TTSDataset out into it's own respository, or at least have it...