Morank88

Results 6 comments of Morank88

I manage to get some progress. Now I training on data from LibriSpeech train-clean-100 and train-clean-360 and testing on train-dev-clean. After 40k steps the SDR reached only to ~5. Is...

Here are tensorboard results: ![image](https://user-images.githubusercontent.com/67504223/86105626-3f830c80-bac8-11ea-8e85-0703e0924913.png)

Great, thx. So here is where my script failed: convtasnet_audio_with_asr_model = DataParallel(ConvTasNet(C=2, test_with_asr=True)).cuda() convtasnet_audio_without_asr_model = DataParallel(ConvTasNet(C=2, asr_addition=False)).cuda() convtasnet_audio_without_asr_model.load_state_dict(torch.load(convtasnet_model)['model_state_dict']) convtasnet_audio_with_asr_model.load_state_dict(torch.load(convtasnet_asr_model)['model_state_dict']) The convtasnet_model is AudioOnlyConvTasNet.pth and the convtasnet_asr_model is ASR.pth, both downloaded...

Hi, It solves the issue of loading the checkpoint, but after running the test on my own mixture wav file I got bad results, basically noise... Here is my script:...

Maybe my pre-processing is different?

Hi @pragyak412, Ok, I understand. What should be the dimension conversion of the asr features for arbitrary audio length? BTW, it seems that my pre-processing follows what is required. Regarding...