devicehive-audio-analysis icon indicating copy to clipboard operation
devicehive-audio-analysis copied to clipboard

TF record files for training custom new model

Open sleebapaul opened this issue 6 years ago • 0 comments

Hi @igor-panteleev, great job. I want to train my own model for two or three particular classes. I'm planning the following training pipeline.

128 dim embedding --> a classifier --> classes

Though I've seen the Google Audio set data is provided in 128 dim tf.records, I couldn't find it in a downloadable form in the site. I found a frame by frame tensorflow.SequenceExample file of 2.4 GB. But is that the same data you've used? Please help me on this.

On evaluation, the pipeline will be

WAV format --> VGGish --> 128 dim embedding --> a classifier --> prediction label

Do I miss something?

sleebapaul avatar Aug 03 '18 10:08 sleebapaul