caffe-speech-recognition
caffe-speech-recognition copied to clipboard
Could you please give an idea on how to generate our own dataset with our voices. In the dataset, what does the number at the end represent. Eg: in the image 0_Karen_160.png , what is 160?