NeMo icon indicating copy to clipboard operation
NeMo copied to clipboard

change tensor interface to B,T,D between mel generator and vocoder

Open junkin opened this issue 3 years ago • 3 comments

updates for TTS models to generate mels with output of type B, T, D and all vocoders to take as input B,T,D shapes to enable continues memory when grabbing chunks of spectrogram at inference time.

junkin avatar Jun 09 '21 00:06 junkin

Please hold off merge of this until we make sure neural type information is included in nemo files.

ryanleary avatar Jun 09 '21 01:06 ryanleary

Please hold off merge of this until we make sure neural type information is included in nemo files.

@ryanleary @junkin Should we try to merge this into NeMo? Or do you still want to block for now?

blisc avatar Jul 20 '21 19:07 blisc

@blisc @junkin and @ryanleary what's the status of this? Do we need this PR? If not, someone, please close

okuchaiev avatar Apr 05 '22 23:04 okuchaiev