peft how to finetune whisper model with 'initial

how to finetune whisper model with 'initial_prompt'

Open ben-8878 opened this issue 1 year ago • 3 comments

when use 'initial_prompt', the decoding result of finetuning with my data on whisper model v2 is bad, on the contrary, the result is good. however, when use 'initial_prompt' the decoding result of based whisper model v2 is also good, so it means If want to use 'initial_prompt' during decoding , must add it when training？

May 06 '24 06:05 ben-8878

Sorry, I don't understand your issue. Could you please explain it in more detail, what you want to achieve and how? Ideally show the code that leads to good or bad results.

May 06 '24 09:05 BenjaminBossan

HI, Now, whisper can use context information to improve recognition accuracy: And, if you want pass context information to whisper, you can use arg for cli: https://github.com/openai/whisper/blob/main/whisper/transcribe.py#L531 parser.add_argument("--initial_prompt", type=str, default=None, help="optional text to provide as a prompt for the first window.") when finetune the whisper model, not use "--initial_prompt", decoding result of finetuned model with using "--initial_prompt" will be worse.

May 06 '24 10:05 ben-8878

I see. I don't really have any expertise in whisper and how the initial prompt affects the outcome. But my best guess is that yes, if you want to use it, you should also use it during training, using the same logic as in the script that you linked.

May 06 '24 10:05 BenjaminBossan

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Jun 05 '24 15:06 github-actions[bot]

peft peft copied to clipboard

how to finetune whisper model with 'initial_prompt'

peft
peft copied to clipboard