CogNative icon indicating copy to clipboard operation
CogNative copied to clipboard

Audio file vocal synthesis INPUT cannot exceed a certain length

Open andrew-fennell opened this issue 2 years ago • 0 comments

Problem

Audio files that are over a minute long do not work as vocal synthesis text input. (if you give an audio file to "copy" the words from, rather than providing text directly)

Error: image

Proposed solution

  • Cut the provided audio into segments
  • Transcribe each audio segment
  • Combine transcriptions

This could run into issues with words and sentences being cut, which would decrease the quality of the transcriptions.

andrew-fennell avatar Apr 18 '22 00:04 andrew-fennell