SoniTranslate icon indicating copy to clipboard operation
SoniTranslate copied to clipboard

Include option to add vocabulary for better transcription

Open Jalaj-KT opened this issue 1 year ago • 1 comments

Problem statement

This pull request addresses the issue of suboptimal transcription quality when using Whisper for video dubbing. Users may experience bad transcription even after using large size models (for uncommon words like vishing, phising etc.)

Solution

The proposed solution introduces an option for users to input custom vocabulary. This enhancement aims to improve transcription accuracy, even with smaller models, by allowing specific words to be recognized more accurately.

Dependencies

No dependencies added.

Jalaj-KT avatar Aug 22 '24 06:08 Jalaj-KT

code change looks good, wish this can be merged @R3gm

agung2001 avatar Mar 01 '25 08:03 agung2001