transformers.js icon indicating copy to clipboard operation
transformers.js copied to clipboard

Is it possible to use transformers.js to implement audio source separation tasks?

Open asasas234 opened this issue 1 year ago β€’ 2 comments

Question

Hello, I have a beginner's question.

I want to implement the task of removing the human voice from the audio in the video and retaining the background sound in the browser. The idea is to load the model for audio source separation related to transformers.js to achieve the separation of the background sound and human voice, and then only return the background sound.

But I couldn't find relevant examples in the documentation, so I was wondering if this can be implemented? If so, what are the learning or research paths?

Looking forward to your reply

asasas234 avatar Jun 02 '24 04:06 asasas234

Hi there πŸ‘‹ This library serves as a JavaScript port of the Python transformers library, so if you know of a model where you can do this, we can certainly look into it! Is something like https://huggingface.co/speechbrain/sepformer-wham what you're looking for?

xenova avatar Jun 03 '24 11:06 xenova

Yes, speechbrain looks good, but I think demucs would be the best. However, I'm a beginner in machine learning and Python, so I'm looking for the simplest solution that can achieve my goal

asasas234 avatar Jun 03 '24 12:06 asasas234

Hi there πŸ‘‹ This library serves as a JavaScript port of the Python transformers library, so if you know of a model where you can do this, we can certainly look into it! Is something like https://huggingface.co/speechbrain/sepformer-wham what you're looking for?

ζˆ‘δΉŸιœ€θ¦

grasilife avatar Dec 26 '24 06:12 grasilife