[FEATURE]: Speech-to-Text Voice Input for Lazy People in OpenCode

Open Fuzu opened this issue 1 month ago • 6 comments

Feature hasn't been suggested before.

[x] I have verified this feature I'm about to request hasn't been suggested before.

Describe the enhancement you want to request

Hi! First of all, congratulations on the amazing project.

I've been working on a Speech-to-Text voice input feature that integrates directly into the TUI. It allows users to start audio recording with a keybind, automatically transcribe speech using different providers, and insert the resulting text directly into the prompt.

I've built an initial working version, currently tested only on macOS, and the system includes:

Real-time audio recording via FFmpeg;
Support for Groq Whisper, OpenAI Whisper, and local whisper.cpp;
Automatic microphone/device detection;
Interactive menus for choosing provider, model, and audio device;
Persistent configuration stored in ~/.opencode/state/speech.json;
Customizable keybinds (Ctrl+X v, Ctrl+X P, Ctrl+X D);
Smooth flow: record → transcribe → insert into prompt input;

Would this be something you'd be interested in integrating into the project?

Nov 24 '25 18:11 Fuzu