Juntae, Kim comments

Results 59 comments of


                                            Juntae, Kim

data and model question

I'm really sorry for late answer, these days I'm so busy : ( . For the above questions: Maybe, your tested speech is from Aurora which contain short utterance and...

data and model question

1. I cannot sure my normalization factor is perfect for every situation. However, if we use large amount of dataset which can represent the population mean and variance, the normalization...

data and model question

NO, the threshold is used only in bdnn and ACAM, please refer the model definition of DNN and LSTM, their prediction is conducted by argmax function across the softmax dimension.

ACAM always detect badly on the start of a corpus

Hi, is there any silence in front of your sample, if not, the result may be not good. Because ACAM is context based model, there should be some samples to...

Exception while executing vad_test.py.

I'm very sorry for late answer. 1. audio_dir = './data/example/clean_speech.wav' # directory for specific audio file 2. the length of your audio data i.e. length(audio) in matlab notation. 3. Unfortunately...

Questions understanding bdnn_transform function.

Excellent! thank you for your interest and contributions! Because it has been a long time since I implemented it, I can't exactly remember it in detail. However, the purpose is,...

After how many iterations did your proposed VAD model converge ?

This QUESTION is quite difficult to answer, because the necessary training time is according to the training data. Therefore, you should inspect your training & validation curve to find appropriate...

After how many iterations did your proposed VAD model converge ?

1. Approximately 2 epochs for whole of my dataset I used. 2. No, that checkpoint is not the optimized one. Now, I only provide the optimized .pb file. If you...

Questions about the data normalization

1. If each sample file has much different noise characteristic and high noise energy, the mean and variance can be depends on noise signal rather than speech signal. However, the...

Can Octave be used?

Rather than Octave, we have a plan to write our code only using the python. please wait for it thank you!