Added example Podcast_and_Audio_Transcription

Open SonjeVilas opened this issue 9 months ago • 7 comments

Adds automated audio transcription using Gemini 2.0 with:

✅ Speaker identification (labeled or as Speaker A/B) ✅ Precision timestamps ([HH:MM:SS]) ✅ Music/sound effect detection (e.g., [Jingle] or [Song Name]) ✅ Clean text output with [END] marker

Testing: Verified with podcasts & call recordings.
Deps: jinja2, Gemini API client.

Useful for podcasts, interviews, and call analysis.

Apr 04 '25 09:04 SonjeVilas

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Apr 04 '25 09:04 review-notebook-app[bot]

Thanks @SonjeVilas, that's an interesting example. I won't have time to review it today but I'll try to do it next week.

Apr 04 '25 13:04 Giom-V

@Giom-V Thanks For the Review... :)

Apr 20 '25 10:04 SonjeVilas

@nikitamaia Thanks for Review :)

Apr 29 '25 09:04 SonjeVilas

Hello @SonjeVilas, do you still want to push that example?

Jun 04 '25 20:06 Giom-V

Thanks for reminder ! I will complete this PR on this weekend.

Jun 05 '25 03:06 SonjeVilas

Done!

Jun 15 '25 07:06 SonjeVilas