LSTM_PIT_Speech_Separation
LSTM_PIT_Speech_Separation copied to clipboard
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
hello, I have problems when I try to train the model, could you tell me the exact meanings of the following variables:  and what do they mean? thanks! I...
How can I use the VCTK-dataset to train the model? Should I alter the structure of VCTK-dataset downloading from the origin webpage? Thanks for your reply.
I am a newbie. Since I have no wsj0 datasets, I have to create mixtures on my own datasets. In the mix_2_spk_tr/cv/tt.txt, each wav is assigned with a weight, just...
I've trained the model using VCTK-dataset and decoded it.How to use the trained model to separate a mixed audio file ?. Do we need to write a new code or...
hi i have wv version of wsj0, how can i change this format to the wav without any information loss in the files? you know wv is a specific compressed...
uPIT
你好。我对uPIT有一个盲点,一直搞不清楚,想向你请教一下。uPIT是针对整条语句的,那它是把一条语音作为一个样本送进网络进行训练的吗? 还是我的理解是错的呢?谢谢你哈。
Can you please explain procedure or different steps to pre-process data before feed to LSTM. I am working on paper by Zhuo Chen on **"Speaker-Independent Speech Separation With Deep Attractor...
Hi can you explain what should be the actual structure of the dataset? not sure i got it.. is it like in the matlab file you attached? i.e. cv/tr/tt folder,...
There are two ways to create the mixtures, version 1 and version 2. Are there any differences? which version should I follow?
Hello, big brother, can this model be used for Chinese segmentation? Do we need to change the data set?thanks!