Kokoro-FastAPI icon indicating copy to clipboard operation
Kokoro-FastAPI copied to clipboard

Feature Request: Support Silence Tags in the text

Open fondoger opened this issue 9 months ago • 2 comments

Kokoro already have phonemes tags feture. Eg: [Kokoro](/kˈOkəɹO/) is an open-weight TTS model.

I want a new feature called silence tags. Eg: Hello. [1s] Nice to meet you.

This silence tags are processed in FastAPI instead of the Kokoro model. For example, if we find an [1s] silence tag, we can add a silence audio frame with duration 1 second between the audio Hello. and Nice to meet you.

Existing Sample See: https://voice-generator.pages.dev Image

fondoger avatar Mar 31 '25 10:03 fondoger

Already working, see https://github.com/remsky/Kokoro-FastAPI/issues/161

Patrick-Ric avatar Apr 28 '25 05:04 Patrick-Ric