whisper-at icon indicating copy to clipboard operation
whisper-at copied to clipboard

Support for whisper-large-v3

Open spaghettiSystems opened this issue 1 year ago • 1 comments

Hello,

First of all, nice work!

Is it possible to release a checkpoint trained with whisper-large-v3? The reason I'm interested in this is that large-v3 is trained on a new dataset with 5 million hours of audio. I'm interested to see how that scaling will impact whisper-at.

Thank you.

spaghettiSystems avatar Feb 16 '24 05:02 spaghettiSystems

thanks for the suggestion, it is a very nice one.

I am currently busy on something else, so cannot do it immediately. What I can tell is:

1/ we provided code to train the model, extracting features from Whisper and do linear probing on ESC-50 should be relatively easy, and fast.

2/ compare v1 and v2, v1 has better general (non-speech) audio classification.

-Yuan

YuanGongND avatar Feb 16 '24 21:02 YuanGongND