Bharani

Results 9 comments of Bharani

Thanks for the response I tried changing the dimensions and using the model from the above link with the released model but got the following error `Error(s) in loading state_dict...

Thanks for the quick reply I actually tried replacing xvector part of the embedding (512) with 2 instances of d-vector from the above model (2×256) and the results improved a...

I actually didn't fine-tune the model, I simply replaced xvector embeddings with dvector embeddings in the following function [IMS-Toucan/Preprocessing/ProsodicConditionExtractor.py](https://github.com/DigitalPhonetics/IMS-Toucan/blob/2cd5d893639e8d4bfa9acffa09a519b37a908768/Preprocessing/ProsodicConditionExtractor.py#L11) Line 11 in [2cd5d89](https://github.com/DigitalPhonetics/IMS-Toucan/commit/2cd5d893639e8d4bfa9acffa09a519b37a908768) class ProsodicConditionExtractor: and tried with unseen languages...

I will try finetuning on the dvector embeddings too then and see the results. I will continue training the Aligner and check posteriograms instead of purely depending on loss By...

Thanks for the tip I will try training a model from scratch, I tried finetuning on d-vector embeddings and it solved most of my gender conversion issue but as you...

Thanks for the quick reply I actually tried replacing xvector part of the embedding (512) with 2 instances of d-vector from the above model (2×256) and the results improved a...

when i am trying to do it as you said using full api it says cannot convert from string to float it is expecting input to be numpy data but...

sorry for troubling you again I tried using np.fromstring but it is not working It doesn't detect anything. Can you please make a program to record audio using pyaudio in...

Thank you very much for the code i will try it and will tell you if it works for me. Thanks again.