AVE-ECCV18
AVE-ECCV18 copied to clipboard
How to extract feature of audio via vggish and then what we can do through the vggish
Hello authors, I appreciate your wonderful contribution. But I have a few questions about how to extract the feature of audio. You said you get audio's feature via Vggish, could you explain the processing. After we get the feature, How to do localization through features?