Slava715
Results
2
comments of
Slava715
> @CoderHam I used the TensorRT backend. The model is NVIDIA's Conformer pre-trained model: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_en_conformer_ctc_small > > The .onnx and .plan files and the .pbtxt files can be found here:...
> @adrianastan Hope it works out for you. Just wanted to add that I got great results by concatenating the speaker embedding directly to the input of the 1) pitch...