Pascal Fischer
Pascal Fischer
When do you expect to release the Image encoder ?
When transcribing a 3min audio with basic parameters and no stem, the resulting .srt file only consists of a part from the original audio sometimes its the start, sometimes the...
hello, I have two question about the Linear Attention that was added later. Can you clear me up why it is called Linear Attention when the referenced paper introduces Cross-Covariance...
From the paper it seems that the architecture of syncnet is exactly the same as for wav2lip. Is that true or is it a modified version with a new Image...