STT issues

Results 114 STT issues

Sort by recently updated

Feature request: Publish Android artifact to Maven

Can reference the PR from DeepSpeech: https://github.com/mozilla/DeepSpeech/pull/1848

xizzhu

enhancement

Re-Architect Training Pipeline

kdavis-coqui

Feature request: Integrated Voice Activity Detection

Quality Voice Activity Detection as part of the API would be very handy. Real time STT in general can be processed these ways: -Continuous inference. -Push To Talk (PTT). -Voice...

BitBarrel

enhancement

Feature request: option to only allow sentences which occur in the language model

An option to only allow sentences which occur in the language model would be very helpful. For example, the language model contains this: flap one gear down Currently STT can...

BitBarrel

enhancement

Feature request: .Net C# streaming VAD on Windows example project

Once the .NET package on NuGet for Windows (Visual Studio) is available again, it would be nice to have a streaming VAD example project available.

BitBarrel

enhancement

Feature request: add text to docs about Opus, not just Wav

right now, the docs only reference training on WAV files, but we can also train on opus. similarly, we should probably rename the expected column headers in the csv data...

JRMeyer

enhancement

Feature request: Docs: add section on how to chose batch size

OOM errors happen all the time in training for newcomers and pros... we should have a simple guide on using `--reverse_{train,test,dev}` and steps to choose the right batchsize on some...

JRMeyer

enhancement

Bug: duplicate command-line training flags silently parsed

Current behavior: If a flag is duplicated at the CLI and passed to `train.py`, there is no warning or error message, and the last setting is saved. This was first...

JRMeyer

bug

Feature request: replace progressbar2 with tqdm

right now we use both `progressbar2` and `tqdm`, but the latter is better maintained and the former has caused issues. We should completely replace `progressbar2` with `tqdm`

JRMeyer

enhancement

Feature request: Link all assets / repos to the docs

- model zoo - STT-examples - model-manager - open-speech-corpora

JRMeyer

enhancement

STT
STT copied to clipboard

Metadata

Feature request: Publish Android artifact to Maven

Re-Architect Training Pipeline

Feature request: Integrated Voice Activity Detection

Feature request: option to only allow sentences which occur in the language model

Feature request: .Net C# streaming VAD on Windows example project

Feature request: add text to docs about Opus, not just Wav

Feature request: Docs: add section on how to chose batch size

Bug: duplicate command-line training flags silently parsed

Feature request: replace progressbar2 with tqdm

Feature request: Link all assets / repos to the docs

← Metadata

Owner

Metadata

STT STT copied to clipboard

Metadata

← Metadata

Owner

Metadata

STT
STT copied to clipboard