espnet icon indicating copy to clipboard operation
espnet copied to clipboard

Question regarding switching speakers, weights during runtime.

Open michaellin99999 opened this issue 1 year ago • 0 comments

Hi, I have finetuned two versions of ESPNet myself. one with a mans voice and one with a womans, the base model is the same based on VITS.

Is there a way where I can hotswap weights during runtime?
the experience I am looking for is I only need to host 1 model. and can simulate a man, woman talking with each other jsut by hotswapping the weights during inference? preferably less than 2 second latency. I am currently running on a 4090 GPU.

michaellin99999 avatar Mar 26 '24 01:03 michaellin99999