transformers.js Is it possible to use transformers.js to implement audio source separation tasks?

Is it possible to use transformers.js to implement audio source separation tasks?

Open asasas234 opened this issue 1 year ago • 2 comments

Question

Hello, I have a beginner's question.

I want to implement the task of removing the human voice from the audio in the video and retaining the background sound in the browser. The idea is to load the model for audio source separation related to transformers.js to achieve the separation of the background sound and human voice, and then only return the background sound.

But I couldn't find relevant examples in the documentation, so I was wondering if this can be implemented? If so, what are the learning or research paths?

Looking forward to your reply

Jun 02 '24 04:06 asasas234

Hi there 👋 This library serves as a JavaScript port of the Python transformers library, so if you know of a model where you can do this, we can certainly look into it! Is something like https://huggingface.co/speechbrain/sepformer-wham what you're looking for?

Jun 03 '24 11:06 xenova

Yes, speechbrain looks good, but I think demucs would be the best. However, I'm a beginner in machine learning and Python, so I'm looking for the simplest solution that can achieve my goal

Jun 03 '24 12:06 asasas234

Hi there 👋 This library serves as a JavaScript port of the Python transformers library, so if you know of a model where you can do this, we can certainly look into it! Is something like https://huggingface.co/speechbrain/sepformer-wham what you're looking for?

我也需要

Dec 26 '24 06:12 grasilife

transformers.js transformers.js copied to clipboard

Is it possible to use transformers.js to implement audio source separation tasks?

Question

transformers.js
transformers.js copied to clipboard