VGGish icon indicating copy to clipboard operation
VGGish copied to clipboard

Could you please provide the details of the pre-trained models?

Open silhouettesia opened this issue 7 years ago • 5 comments

Did you train the models on the AudioSet for AED? How did you get the raw audio dataset? Could you explain how to train the model without the fc layers? Thanks!

silhouettesia avatar Jan 01 '18 12:01 silhouettesia

or you just turn the released tensorflow trained model into keras ?

silhouettesia avatar Jan 08 '18 08:01 silhouettesia

Yes, I just simply turn the released TF trained model into Keras, without any training on the AudioSet.

DTaoo avatar May 27 '18 14:05 DTaoo

@DTaoo Is the repo for train the Vggish model ? Or is it to use the pretrained VGGish model?

xav12358 avatar Oct 30 '18 15:10 xav12358

I'm late for the party... but will answer it anyway, hope it's useful for someone else.

@xav12358 It is for both... you can initialize the model with random weights and retrain them from scratch, or just use the given weights, or even retrain from the given weights (aka fine tuning). This repo provides a Keras version of the VGGish model, and let you load it with the given weigths, then you can do whatever you want with it. The code for train/fine tune the model is not given but is not that difficult.

I'll try to upload one example later

@DTaoo shouldn't this issue be clossed?

rola93 avatar Jul 06 '19 11:07 rola93

It might be nice if the README.md had a link to where to find vggish_audioset_weights.h5 and vggish_audioset_weights_without_fc2.h5. The current link to TensorFlow's version points to a tensorflow checkpoint file for the weights, which can't be directly loaded into keras.

Jacob-Stevens-Haas avatar Feb 05 '20 21:02 Jacob-Stevens-Haas