Rotem Dan
Rotem Dan
Currently, running the server (`echogarden serve`) and opening the local host HTTP page (`http://localhost:45054`) shows a basic placeholder message (`"This is the Echogarden HTTP server!"`) Gradually, start developing a graphical...
The browser extension, currently in development, already has the core functionality of communicating with the server, full word highlighting, and being able to speak starting at a selected page element....
It is technically possible, overall, since the core components: `espeak-ng` and `onnxruntime` both fully support running in the browser. Actually, `onnxruntime-web`, unlike `onnxruntime-node` (the currently used package), can also make...
The current two engines (`tinyld` and `fasttext`) aren't always accurate and sometime produce odd or nonsensical classifications, like classifying English text as Klingon. I've developed a custom engine, based on...
Hi, I have a suggestion. When using the transcript alignment function, could the program try to split sentences at commas and periods as much as possible? Sometimes a sentence doesn't...
OpenAI provides a subscription-based cloud service that is able to transcribe speech using the largest Whisper model (`large-v2`): ``` https://api.openai.com/v1/audio/transcriptions ``` And translate speech using the same model: ``` https://api.openai.com/v1/audio/translations...
In the VITS and eSpeak engines, the text is converted to phonemes using the phoneme events produced by the eSpeak speech synthesizer during synthesis. eSpeak does a reasonable job in...
For example, when the default English voice (Amy / Low) gets an utterance that is a single word, like "two", it seems to mispronounce it as something that sounds closer...
Not very easy to implement at the moment. May require significant changes in many source files.
When a Wikipedia article URL like `https://en.wikipedia.org/wiki/Garden` is given to `speak-url` or `speak-wikipedia`, detect the article's language from the URL, and use the Wikipedia parsing package to get plain text...