OpenAdapt Implement audio recording for narration in record.py

Implement audio recording for narration in record.py

Open abrichr opened this issue 1 year ago • 4 comments

We would like to let the user explain what they are doing as they do it when creating a recording.

This is a replacement for https://github.com/MLDSAI/OpenAdapt/issues/57 , which is proving to be more difficult than we would like.

May 19 '23 18:05 abrichr

Do we want the recording to be captured and converted to be then mixed into the inputs for LLM completions somehow? (Whisper)

May 20 '23 04:05 FFFiend

@FFFiend exactly right!

May 21 '23 04:05 abrichr

Awesome, I'm just confused about one thing: once voice is captured, how exactly do we want to transform the data ? Perhaps a 1-1 mapping between Events and the actual content of the user's speech, because I couldn't see the reasoning behind generating completions for Audio when thats double the work that we're already doing. So is this more to increase the accuracy of completions (which is where the mapping comes in)? 😮

May 23 '23 00:05 FFFiend

@FFFiend good question. The idea here is to include the transcription in the model prompt to provide context to the model for what is important.

Related: https://github.com/OpenAdaptAI/OpenAdapt/issues/564

Mar 07 '24 20:03 abrichr

OpenAdapt OpenAdapt copied to clipboard

Implement audio recording for narration in record.py

OpenAdapt
OpenAdapt copied to clipboard