Meshtastic-Android icon indicating copy to clipboard operation
Meshtastic-Android copied to clipboard

TTS support

Open DeadlyDad opened this issue 4 years ago • 10 comments

It just struck me this morning (while pondering another project) that one way to transmit voice over LORA would be to use cellphone apps on either end to do speech to text on one end, send the text, then do text to speech on the other. (Might it be possible for individuals to set up phoneme databases so that the transmitted voice would be theirs?) One further advantage would be a transcript of the conversation. (...and a lot of laughter from autocorrect hijinx. :grin: )

Does that make sense?

DeadlyDad avatar Mar 27 '20 22:03 DeadlyDad

What's the ~~max/avg~~ realistic bandwidth for LORA?

jeksys avatar Mar 27 '20 22:03 jeksys

Alas I think this is not a good for for Lora. Because when configured for long range, the number of chirps per bit becomes quite long.

I.e. sending a 100 byte packet takes a couple of seconds.

And there are maximum duty cycle limits for transmitters for most regulators. For our 'occasional' gps and text messages no problem, but if you squirt many packets quickly you could have problems.

(Sent from a phone - please ignore typos)

On Fri, Mar 27, 2020, 15:23 Eugene Yagrushkin [email protected] wrote:

What's the max/avg bandwidth for LORA?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/meshtastic/Meshtastic-esp32/issues/66#issuecomment-605339774, or unsubscribe https://github.com/notifications/unsubscribe-auth/AABXB2JT2WH4BYE4RCASTDDRJURPRANCNFSM4LVKNCPA .

geeksville avatar Mar 27 '20 22:03 geeksville

What's the ~max/avg~ realistic bandwidth for LORA?

I'm not sure why you are asking me that, but there isn't a simple answer: https://medium.com/home-wireless/testing-lora-radios-with-the-limesdr-mini-part-2-37fa481217ff

If you want to know more, I invite you to do some research on your own. (I'm not a LORA expert.)

DeadlyDad avatar Mar 27 '20 22:03 DeadlyDad

Oh I see. You mean speech to text and text to speech then send characters. Yes, that would be possible. Either with a separate app that uses service we have running on the phone or by sending in a PR for the app itself.

(Sent from a phone - please ignore typos)

On Fri, Mar 27, 2020, 15:42 Kevin Hester [email protected] wrote:

Alas I think this is not a good for for Lora. Because when configured for long range, the number of chirps per bit becomes quite long.

I.e. sending a 100 byte packet takes a couple of seconds.

And there are maximum duty cycle limits for transmitters for most regulators. For our 'occasional' gps and text messages no problem, but if you squirt many packets quickly you could have problems.

(Sent from a phone - please ignore typos)

On Fri, Mar 27, 2020, 15:23 Eugene Yagrushkin [email protected] wrote:

What's the max/avg bandwidth for LORA?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/meshtastic/Meshtastic-esp32/issues/66#issuecomment-605339774, or unsubscribe https://github.com/notifications/unsubscribe-auth/AABXB2JT2WH4BYE4RCASTDDRJURPRANCNFSM4LVKNCPA .

geeksville avatar Mar 27 '20 22:03 geeksville

Thanks for the reply. I suppose 400bps or less would be workable if communications were kept short and folks understood the limitations. (...and if you could figure out how to get a fairly reliable 700bps, you could have actual VOIP. ...ish: https://hackaday.com/2017/01/20/voice-at-700-bits-per-second )

DeadlyDad avatar Mar 27 '20 22:03 DeadlyDad

@DeadlyDad thank you for detailed answer :)

100 byte packet per couple of seconds is not a lot.

Most of speech recognition frameworks require network connection. For iOS: "On-device speech recognition is available for some languages, but the framework also relies on Apple’s servers for speech recognition. Always assume that performing speech recognition requires a network connection." https://developer.apple.com/documentation/speech

There are frameworks which doesn't require connection. OpenEars is one of them, though I'm not sure if it's good enough.

jeksys avatar Mar 27 '20 22:03 jeksys

agree. I'm going to move this into the "good idea which we would happily accept PRs for, but won't have time to ponder ourselves" queue.

geeksville avatar Mar 28 '20 00:03 geeksville

(While some may require an internet connection, I was using voice dictation software on a Pentium running Windows 95 before the internet was even available where I lived, and there are a number of projects that don't require a network connection. Here, for example, are five of them: https://fosspost.org/alternative-software/developers/open-source-speech-recognition-speech-to-text )

DeadlyDad avatar Mar 28 '20 09:03 DeadlyDad

This issue has been mentioned on Meshtastic. There might be relevant details there:

https://meshtastic.discourse.group/t/ptt-voice-transmission/356/6

geeksville avatar Jun 12 '20 08:06 geeksville

Moved to the android repository. The request is to use the local device speech to text and text to speech.

mc-hamster avatar Mar 27 '21 05:03 mc-hamster

this now can be done without the need for in-app support.

andrekir avatar Oct 01 '22 13:10 andrekir