Dual-Path-RNN-Pytorch icon indicating copy to clipboard operation
Dual-Path-RNN-Pytorch copied to clipboard

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Results 36 Dual-Path-RNN-Pytorch issues
Sort by recently updated
recently updated
newest added

学长好,我在运行您的DPRNN的时候,运行python train_rnn.py --opt config/Dual_RNN/train_rnn.yml时,不知道为什么一直报下面的错,不知道为什么: 22-06-09 16:31:20 [train_rnn.py:69 - INFO ] Building the model of Dual-Path-RNN 22-06-09 16:31:20 [train_rnn.py:72 - INFO ] Building the optimizer of Dual-Path-RNN 22-06-09 16:31:20 [train_rnn.py:76 -...

请问我可以使用cpu训练吗 在train.yml里面应该怎么修改

C:\Users\yhq\.conda\envs\pytorch\python.exe D:/Dual-Path-RNN-Pytorch-master/Dual-Path-RNN-Pytorch-master/train_rnn.py C:\Users\yhq\.conda\envs\pytorch\lib\site-packages\torchaudio\extension\extension.py:14: UserWarning: torchaudio C++ extension is not available. warnings.warn('torchaudio C++ extension is not available.') Traceback (most recent call last): File "D:/Dual-Path-RNN-Pytorch-master/Dual-Path-RNN-Pytorch-master/train_rnn.py", line 91, in train() File "D:/Dual-Path-RNN-Pytorch-master/Dual-Path-RNN-Pytorch-master/train_rnn.py", line...

您好,我看到您的知乎分享贴中说DPRNN好像一个batch size效果最好, 您试过batch size等于2或者更高的时候吗,您在代码的readme中说100个epoch之后DPRNN的sisnr能到达18.98dB,那batch size = 2的时候sisnr能到达多少您记得吗?因为batchsize=1训练起来非常慢,我想试试batch size=2的时候,但是不知道能获得什么效果。

学长,8000hz训练的,输出应该是8000吧,宁写出的时候那里写成16khz了

学长,因为您这套代码是用8khz做训练的,我能否用您的pretrain-model在16khz数据集上做微调呢?

想知道Conv-TasNet为什么不保留原论文中的skip-connection部分呢?

I try to train the model with 3-mix datasets, and I just change the num_speakers and data path, is there anything else I need to modify? thanks! the following is...

这个是什么意思,怎么修改呢?求教 ![image](https://user-images.githubusercontent.com/49680845/154786971-a9a13460-8db3-467d-8603-accbc37f07b7.png)

训练生成的模型用于分离测试集中的混合音频,为何得到的分离音频有较大噪声呢? stoi指标很低,有什么方法改进吗? 求助,万分感谢!