Spoken-language-identification icon indicating copy to clipboard operation
Spoken-language-identification copied to clipboard

How to run this code

Open YichiHuang opened this issue 8 years ago • 4 comments

Hi, I got a speech dataset, the length of each recording ranges from 2s~5s. How I run your code with my dataset? Can you give me some advice. Thanks.

YichiHuang avatar Nov 29 '16 11:11 YichiHuang

Hi,

First you should create the spectrograms of the recordings (you can use create_spectrograms.py for that), then make training and validation list files like this and finally, run the theano/main.py. As the length of recordings in your dataset is not constant you should either set batch_size=1 or do something to equalize the lengths in mini-batch.

hrayrhar avatar Nov 30 '16 16:11 hrayrhar

Hi, I followed your experiment procedure and run with my dataset. My dataset contains 50 classes, 400 recordings with 5 seconds for each category. First of all, I created spectrograms of each recording with size in 256x429. Then I split them in training set(1500 in total) and validation set(500 in total). Last, I run the main.py with default parameters except: --network==tc_net_mod I got the result: accuracy: 0.59 percent accuracy: 2.08 percent saving ... states/tc_net_mod.b32.bn.epoch499.test3.91200.state The accuracy is extremely low and I wonder there is something wrong with my experiment. Can you give some comments? Thanks.

YichiHuang avatar Dec 05 '16 00:12 YichiHuang

screenshot from 2017-02-26 09-06-00 getting error like this

MurugeshMarvel avatar Feb 26 '17 03:02 MurugeshMarvel

@YichiHuang can you please share your code. I trying to learn too.

manbharae avatar Jan 18 '18 16:01 manbharae