w2v2-how-to icon indicating copy to clipboard operation
w2v2-how-to copied to clipboard

korean, depression/normal audio data set

Open alexxony opened this issue 1 year ago • 3 comments

I have korean audio data sets which are labeled as depression and normal.

And each of them are at least 2 minutes.

Can I apply this model??

alexxony avatar Sep 19 '23 05:09 alexxony

As a start I would suggest you extract embeddings with the model and use them as features to train some classifier, e.g. a SVM. This should give you an idea if the model is applicable to your problem. In a next step you could try to fine-tune the model on your data.

frankenjoe avatar Sep 19 '23 07:09 frankenjoe

As a start I would suggest you extract embeddings with the model and use them as features to train some classifier, e.g. a SVM. This should give you an idea if the model is applicable to your problem. In a next step you could try to fine-tune the model on your data.

and How can i use gpu?? it took too long time

alexxony avatar Sep 19 '23 23:09 alexxony

https://audeering.github.io/audonnx/usage.html#run-on-the-gpu

frankenjoe avatar Sep 20 '23 06:09 frankenjoe