devicehive-audio-analysis
devicehive-audio-analysis copied to clipboard
TF record files for training custom new model
Hi @igor-panteleev, great job. I want to train my own model for two or three particular classes. I'm planning the following training pipeline.
128 dim embedding --> a classifier --> classes
Though I've seen the Google Audio set data is provided in 128 dim tf.records, I couldn't find it in a downloadable form in the site. I found a frame by frame tensorflow.SequenceExample
file of 2.4 GB. But is that the same data you've used? Please help me on this.
On evaluation, the pipeline will be
WAV format --> VGGish --> 128 dim embedding --> a classifier --> prediction label
Do I miss something?