AI Enthusiast comments

Results 45 comments of


                                            AI Enthusiast

Keras

@ujscjj Sorry, I haven't tried keras. You know, Tensorflow can be backend in Keras but Tensorflow must be used when many underlying operations are involved.

what is the difference between version 1 and version 2?

both versions are okay

Dataset structure

The actual structure of the dataset is as what you say. "run.sh" is in my programs but not uploaded. This file is including all processes.

Dataset structure

@shreyasose Hi, I'm sorry for sharing the dataset because the dataset WSJ0 needs paid license.

How to set the weight of each wav in the mix_2_spk_tr.txt, mix_2_spk_cv.txt...

@soliloquy1983 Yeah, randomly.

uPIT

@ujscjj 是的。之所以这么做是因为如果按照帧进行训练输出，在测试时因为没有先验信息就不知道一个T-F bin属于哪一个说话人。与此同时，实验证明句子级别的置换不变训练)的方法相对于多类回归的模型也更加容易收敛，在没有见过的说话人甚至语言上也有很好的泛化性能。

uPIT

@Yangjie55 最近我正在看并实现DANet。TIMIT数据集可以用这个代码的，简单点的话就是把TIMIT文件的组织形式改成WSJ0的文件组织形式，不然就修改代码

uPIT

@nagasaibharath Hi, you just need to save the mixed audio to folders in the same format as WSJ0's.

Model suitability

This model is speaker-independent and language-independent. @CLCLpis

PESQ Problem

@ujscjj The code has been sent to you.