Voice-Recorder icon indicating copy to clipboard operation
Voice-Recorder copied to clipboard

Offline, privacy-respecting speech to text

Open RustoMCSpit opened this issue 1 year ago • 10 comments
trafficstars

Checklist

  • [X] I made sure that there are no existing issues - open or closed - to which I could contribute my information.
  • [X] I made sure that there are no existing discussions - open or closed - to which I could contribute my information.
  • [X] I have read the FAQs inside the app (Menu -> About -> FAQs) and my problem isn't listed.
  • [X] I have taken the time to fill in all the required details. I understand that the bug report will be dismissed otherwise.
  • [X] This issue contains only one feature request.
  • [X] I have read and understood the contribution guidelines.
  • [ ] I optionally donated to support the Fossify mission.

Feature description

Speech-to-text transcription of audios that recognises multiple speakers. Able to see text of any audio by dropdown, or search bar, and exporting of all trascribed text as well.

Why do you want this feature?

would also be able to allow for a transcript so you could have a search bar and go through your voice recordings and you could click through the exact moment that word was said in the voice recordings. so if i typed 'adam' it may find 4 hits from the past 4 months: file191: 00:07 file179: 12:23, 16:30 file73: 06:42

you could then click on those moments to find the one youre looking for.

this could also be used for tagging, for example, if im working on a project called 'block runner' i could search for all mentions and tag them all easily

Additional information

Futo has partially delivered on this with an excellent FOSS solution: https://gitlab.futo.org/alex/voiceinput https://voiceinput.futo.org/

But the Futo solution currently works within other apps only and is not integrated directly into a voice recorder app. Adding Futo's speech-to-text capabilities to Simple Voice Recorder would make a voice recorded easily on par with Google's proprietary app.

RustoMCSpit avatar Feb 26 '24 16:02 RustoMCSpit

https://github.com/FossifyOrg/Voice-Recorder/issues/17

RustoMCSpit avatar Feb 26 '24 16:02 RustoMCSpit

+1

Warden20 avatar May 17 '24 05:05 Warden20

Also looking for something like this. Lots of proprietary apps but no FLOSS ones.

satvikpendem avatar Jan 20 '25 09:01 satvikpendem

Though it will land an AF on fdroid, it might be easy to copy from Whisper. https://github.com/woheller69/whisperIME

endingisnight avatar Jan 25 '25 02:01 endingisnight

Please integrate the FUTO thingy, would love this feature!

PinguDEV-original avatar Mar 10 '25 08:03 PinguDEV-original