Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Test frameworks for transcription and diarization of YouTube videos

Open finitearth opened this issue 2 years ago • 4 comments

Youtube Videos can be a good source of dialogs between interviewers and experts, although the dialog quality of the videos chosen should be considered when choosing which videos could help fine tuning.

Frameworks of consideration could be OpenAIs Whisper for transcription and pyannote for diarization (identification for when speaker switch)

Would be happy to take on this task! :)

finitearth avatar Jan 06 '23 15:01 finitearth

it would be cool if after transcripts and diarisation same questions can be matched up so that the different responses can be ranked. i.e. Lex's podcast tends to ask guests the same questions to many different people.

danielpatrickhug avatar Jan 06 '23 17:01 danielpatrickhug

@finitearth ok, this idea was mentioned a few times. I think it is very interesting, please go ahead with your tests and report back .. maybe create an ipynb or some scripts if possible.

andreaskoepf avatar Jan 06 '23 19:01 andreaskoepf

and a readme please :)

huu4ontocord avatar Jan 06 '23 20:01 huu4ontocord

@finitearth wanted to check on status of this issue.

huu4ontocord avatar Jan 22 '23 03:01 huu4ontocord

Closing old data issue.

andreaskoepf avatar Jun 14 '23 08:06 andreaskoepf