3D-convolutional-speaker-recognition-pytorch
3D-convolutional-speaker-recognition-pytorch copied to clipboard
POI utterance duration
I recently requested the voxceleb1 data and got access to it. But I couldn't find the POI start and end time values in any of the .txt files provided in the dataset. Any idea whether the dataset format changed and or I am missing anything here to get my research forward?
I have the same problem. Have you solved it?