narrator icon indicating copy to clipboard operation
narrator copied to clipboard

[PhotoBooth] mode with image persistence along side audio file. Also, streaming capabilities, setup script, readme updates, and narrator prompt update

Open rsmets opened this issue 2 years ago • 3 comments

Summary

  • Added a setup.sh script to facilitate getting the two shell envs setup to run capture.py and narrator.py.Also, updated README with setup.sh usage info. From https://github.com/cbh123/narrator/pull/32
  • Updated to the narrator GPT prompt to explicitly have it describe the image as David Attenborough for increased complex descriptors (and humor). From https://github.com/cbh123/narrator/pull/33
  • Adds the ability to stream the Eleven Labs text to voice which creates a more immediate experience. From https://github.com/cbh123/narrator/pull/34.
  • Added dotenv capabilities, including .env.example file and README updates. Based on https://github.com/cbh123/narrator/pull/30/files.
  • Format linting
  • Added photo booth mode, which only analyzes the image when the space bar is pressed.
  • Image file saved along side the corresponding audio file.

rsmets avatar Nov 24 '23 18:11 rsmets

This looks great! thank you! Will take a look tonight or tomorrow

cbh123 avatar Nov 27 '23 23:11 cbh123

very nice contribution!

eroncanc avatar Dec 09 '23 17:12 eroncanc

Great PR! Let's see it merged :D

chinciusan avatar Dec 21 '23 15:12 chinciusan