cookbook icon indicating copy to clipboard operation
cookbook copied to clipboard

Added example Podcast_and_Audio_Transcription

Open SonjeVilas opened this issue 9 months ago • 7 comments

Adds automated audio transcription using Gemini 2.0 with:

✅ Speaker identification (labeled or as Speaker A/B) ✅ Precision timestamps ([HH:MM:SS]) ✅ Music/sound effect detection (e.g., [Jingle] or [Song Name]) ✅ Clean text output with [END] marker

  • Testing: Verified with podcasts & call recordings.
  • Deps: jinja2, Gemini API client.

Useful for podcasts, interviews, and call analysis.

SonjeVilas avatar Apr 04 '25 09:04 SonjeVilas

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Thanks @SonjeVilas, that's an interesting example. I won't have time to review it today but I'll try to do it next week.

Giom-V avatar Apr 04 '25 13:04 Giom-V

@Giom-V Thanks For the Review... :)

SonjeVilas avatar Apr 20 '25 10:04 SonjeVilas

@nikitamaia Thanks for Review :)

SonjeVilas avatar Apr 29 '25 09:04 SonjeVilas

Hello @SonjeVilas, do you still want to push that example?

Giom-V avatar Jun 04 '25 20:06 Giom-V

Thanks for reminder ! I will complete this PR on this weekend.

SonjeVilas avatar Jun 05 '25 03:06 SonjeVilas

Done!

SonjeVilas avatar Jun 15 '25 07:06 SonjeVilas