NeMo
NeMo copied to clipboard
change tensor interface to B,T,D between mel generator and vocoder
updates for TTS models to generate mels with output of type B, T, D and all vocoders to take as input B,T,D shapes to enable continues memory when grabbing chunks of spectrogram at inference time.
Please hold off merge of this until we make sure neural type information is included in nemo files.
Please hold off merge of this until we make sure neural type information is included in nemo files.
@ryanleary @junkin Should we try to merge this into NeMo? Or do you still want to block for now?
@blisc @junkin and @ryanleary what's the status of this? Do we need this PR? If not, someone, please close