LSTM_PIT_Speech_Separation icon indicating copy to clipboard operation
LSTM_PIT_Speech_Separation copied to clipboard

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Results 17 LSTM_PIT_Speech_Separation issues
Sort by recently updated
recently updated
newest added

hello, I have problems when I try to train the model, could you tell me the exact meanings of the following variables: ![image](https://user-images.githubusercontent.com/49680845/151945969-71b9377c-db3b-4dae-a8b4-2b27f7cb8f5a.png) and what do they mean? thanks! I...

How can I use the VCTK-dataset to train the model? Should I alter the structure of VCTK-dataset downloading from the origin webpage? Thanks for your reply.

I am a newbie. Since I have no wsj0 datasets, I have to create mixtures on my own datasets. In the mix_2_spk_tr/cv/tt.txt, each wav is assigned with a weight, just...

I've trained the model using VCTK-dataset and decoded it.How to use the trained model to separate a mixed audio file ?. Do we need to write a new code or...

hi i have wv version of wsj0, how can i change this format to the wav without any information loss in the files? you know wv is a specific compressed...

你好。我对uPIT有一个盲点,一直搞不清楚,想向你请教一下。uPIT是针对整条语句的,那它是把一条语音作为一个样本送进网络进行训练的吗? 还是我的理解是错的呢?谢谢你哈。

Can you please explain procedure or different steps to pre-process data before feed to LSTM. I am working on paper by Zhuo Chen on **"Speaker-Independent Speech Separation With Deep Attractor...

Hi can you explain what should be the actual structure of the dataset? not sure i got it.. is it like in the matlab file you attached? i.e. cv/tr/tt folder,...

There are two ways to create the mixtures, version 1 and version 2. Are there any differences? which version should I follow?

Hello, big brother, can this model be used for Chinese segmentation? Do we need to change the data set?thanks!