DeepSpeech icon indicating copy to clipboard operation
DeepSpeech copied to clipboard

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Results 133 DeepSpeech issues
Sort by recently updated
recently updated
newest added

Further to discussion [here](https://discourse.mozilla.org/t/discuss-potential-pr-related-to-generate-lm-py/75883?u=nmstoker), this is my PR for enabling **generate_lm.py** to accept multiple input texts which are combined into a single lm.binary output for onward creation of a scorer....

Both versions are installed from the npm package. TensorFlow: v2.3.0-6-g23ad988 DeepSpeech: v0.9.3-0-gf2e9c85 Tried in both these environments: * Windows 10.0.19042 / Python 3.9.0 * WSL2 Debian 10 / Python 3.7.3...

Solves #3502 - Added standalone Java bindings, that do not rely on Android - Script to produce and collect all required files for usage in the end users own projects...

Now that we masterize better the build-and-cache aspect of TaskCluster, it might be a good time to re-think the interfacing for the known-to-be-painful build dance between TensorFlow and TaskCluster: -...

Will you add the .Net Standard library into nuget package?

help wanted
good first bug

TaskCluster is a CI service provided by Mozilla, and available to both Firefox development (Firefox-CI instance) and Community on Github (Community TaskCluster). It’s being widely used across some Mozilla projects,...

help wanted
ci

This issue is for discussing dropping support for the full TensorFlow runtime on macOS. One recent issue where the difficulty of supporting full TensorFlow on macOS has come up is...

help wanted

``` + ls -halR DeepSpeech.0.10.0-alpha.3 DeepSpeech.0.10.0-alpha.3: total 7.7M drwxr-xr-x 1 task_161425170976582 None 0 Feb 25 11:17 . drwxr-xr-x 1 task_161425170976582 None 0 Feb 25 11:17 .. -rw-r--r-- 1 task_161425170976582 None...

bug
help wanted
branch:master
ci
Windows

It'd make things easier and more performant when using this project with some game engines which provide audio data as float samples, instead of int16. I've looked into the source...