Indic-Subtitler icon indicating copy to clipboard operation
Indic-Subtitler copied to clipboard

Pre-process input audio to just use vocals and remove background noise

Open kurianbenoy opened this issue 11 months ago • 1 comments

Is your feature request related to a problem? Please describe.

Most of the ASR models are trained in clean audio data with minimal background choice. One good way to reduce error rate is using vocals

Describe the solution you'd like

Demucs

  • https://github.com/facebookresearch/demucs
  • https://github.com/xserrat/docker-facebook-demucs

kurianbenoy avatar Feb 25 '24 18:02 kurianbenoy