gexgd0419 comments

Results 114 comments of


                                            gexgd0419

Any intention to make the use of embedded speech to text feature?

The keys are for the models that will be installed when you go to Windows Settings > Time & language > Language & region, open the language option for a...

Any intention to make the use of embedded speech to text feature?

@jinlife You should search your installed "speech pack" in the Library page, instead of searching in the top search bar. After that, click share, copy link. Anyway, here's the link...

offline voices not showing

A new version [v0.2](https://github.com/gexgd0419/NaturalVoiceSAPIAdapter/releases/tag/v0.2) has been released. You can check if the problem still exists in this version. > mark nowhere to be found This is normal. The TtsApplication supports...

offline voices not showing

Version [v0.2.2](https://github.com/gexgd0419/NaturalVoiceSAPIAdapter/releases/tag/v0.2.2) has been released. @Wazard Does this issue still exist?

natural voice sapi adapter api key

A new version [v0.2](https://github.com/gexgd0419/NaturalVoiceSAPIAdapter/releases/tag/v0.2) has been released! Now there's support for enumerating Azure voices, and a UI for entering the API key.

natural voice sapi adapter api key

The voice list is fetched from the server, so you can use all the voices available to your Azure subscription. To avoid blocking the client when enumerating the voices, the...

能否提高离线讲述人自然语音的响应速度

我用程序测试了一下，几种不同情况下，从按下键盘按键到读屏软件输出音频的延迟。其中，NVDA 版本为 2024.1，系统为 Win 11 23H2，所有语音都调整到了最大速度。测试方法是输入A到Z的字母，然后取每次延迟的平均值。延迟的计算方法是，使用程序监测键盘和回录系统声音，记录键盘按下的时间点、音频数据变为非零的时间点（即输出音频延迟）、音频数据大于0.0001的时间点（即音频可听见延迟）。 | 读屏软件 | 语音 | 输出音频延迟 | 音频可听见延迟 | | - | - | - | - | | NVDA |...

能否提高离线讲述人自然语音的响应速度

首先要注意一点，即使是离线语音，自然语音也比普通语音的延迟更高。因为自然语音看起来是使用了 AI 模型运算的，使用了 onnxruntime 组件，也会有一定的 CPU 占用。可以把“讲述人 Xiaoxiao (内置功能)”的延迟作为基准，如果讲述人内置的自然语音功能的延迟可以接受的话，或许还有优化空间。 OneCore 接口的问题在于，微软并没有提供“如何编写第三方 OneCore 语音”的文档，反而在 OneCore 语音的 [SpeechSynthesizer 文档](https://learn.microsoft.com/en-us/uwp/api/windows.media.speechsynthesis.speechsynthesizer#remarks)里指出 OneCore 语音必须有微软的签名。 > Only Microsoft-signed voices installed on the system can be used...

能否提高离线讲述人自然语音的响应速度

如果想要减小读屏延迟，最好的方法可能是使用直接与读屏软件对接的插件。NVDA 实际上已经有了微软离线自然语音的插件，名叫 [NeuralVoices](https://datajake.braillescreen.net/tts/synthesizers%20for%20nvda/neuralVoices-1.0.nvda-addon)，而且里面自带所有离线语音包。目前还没测试过它的延迟水平如何。在 #18 里也有人建议我出一个 NVDA 插件，因为 NVDA 不支持 SAPI5 语音的自动语言切换，并且这个 NeuralVoices 也不支持自动语言切换。我暂时没有出 NVDA 插件的计划，不过给 NVDA 提了 PR 修了 SAPI5 的语言切换问题。SAPI5 更主要的是通用性，不只限于读屏软件，任何需要 TTS 的程序都可以使用。这个项目本身只是一个将 SAPI5 对接到 Azure Speech...

does ssml language work for multilingual voices?

Unfortunately, Microsoft Edge online voices only support a very limited subset of SSML. `` tags are not supported. Also, any unsupported SSML tag will make the server throw an "SSML...