SincNet issues

Performance on speaker identification is good ,but on speaker verification is poor

4

I'm using your model to train on TIMIT, my parameters are the same as yours ,such as lr、batchsize. I use the script named ‘speaker_id.py ’ to train model, and ‘compute_d_vector.py’...

zrtlemontree

Sincnet show the spectrogram is so odd ，it create so many stripe，why？？？？

1

![image](https://user-images.githubusercontent.com/83598861/148634330-640eb3c7-16d8-43e7-a45f-a98750cdac34.png)

nickthereal1997

Voxceleb1 cfg file?

Hi Mirco, can you please upload cfg file or data preprocessing method for Voxceleb dataset? Using cfg file of Librispeech or Timit yields a very high EER.

Tonmoy1321

Question about the creation of the Sinc filterbank

Hi and thank you for sharing the code! I was studying the creation of the Sinc filterbank in the SincConv_fast class and I have a question about this section: ```...

fColangelo

Cannot reproduce cumulative frequency response of SincNet on Speaker-id

4

Hello Mirco Ravanelli, Training SincNet for speaker-id using TIMIT data following your directions, with the config file provided in your github end up with a different cumulative frequency response. The...

thibaultallenet-cea

fix pickle error when loading class_dict_file

numpy.save has the allow_pickle=True as default while numpy.load has allow_pickle=False, so this raises an error that can be fixed by explicitly setting the allow_pickle variable.

tejuafonja

fix "list index out of range" error for fc_drop

The current LibriSpeech configuration throws an error due to the missing third variable for fc_drop. We can either reduce the number of the fully connected layer (fc_lay) or increase the...

tejuafonja

loss function returns nan

I had the problem of running the code and got nan from the loss function after 2-3 iterations. While testing the problem I saw that some parameters such as gamma...

natank1

EER is very high on dataset VoxCeleb1

2

I test the performance of your pretrained model on the famous dataset VoxCeleb1(http://www.robots.ox.ac.uk/~vgg/data/voxceleb/). The EER I got is 30%. I used the compute_d_vector.py to get the d_vector of the audio...

heyayun18188

Hamming window simplification

https://github.com/mravanelli/SincNet/blob/master/dnn_models.py#L106-L108 : ```python #self.window_ = torch.hamming_window(self.kernel_size) n_lin=torch.linspace(0, (self.kernel_size/2)-1, steps=int((self.kernel_size/2))) # computing only half of the window self.window_=0.54-0.46*torch.cos(2*math.pi*n_lin/self.kernel_size); ``` Could it be replaced instead by `self.window_ = torch.hamming_window(kernel_size)[:kernel_size // 2]`? Or...

vadimkantorov

SincNet
SincNet copied to clipboard

Metadata

Performance on speaker identification is good ,but on speaker verification is poor

Sincnet show the spectrogram is so odd ，it create so many stripe，why？？？？

Voxceleb1 cfg file?

Question about the creation of the Sinc filterbank

Cannot reproduce cumulative frequency response of SincNet on Speaker-id

fix pickle error when loading class_dict_file

fix "list index out of range" error for fc_drop

loss function returns nan

EER is very high on dataset VoxCeleb1

Hamming window simplification

← Metadata

Owner

Metadata

SincNet SincNet copied to clipboard

Metadata

← Metadata

Owner

Metadata

SincNet
SincNet copied to clipboard