spchcat issues

Running on Standby Mode for File Input

I am looking to use speechcat as in on demand .wav file transcription. However, I require the model to be preloaded and waiting for intermittent transcription of input .wav files....

kwokyto

Not working on a Raspberry Pi

2

I am trying to get spchcat working on my raspberry pi. When running the command it is printing in the console this: ``` TensorFlow: v2.3.0-14-g4bdd3955115 Coqui STT: v1.1.0-0-gf3605e23 ``` and...

MiniMinnoww

Use feature test to expose `setenv`

As per the [man page](https://man7.org/linux/man-pages/man3/setenv.3.html), `setenv` requires `_POSIX_C_SOURCE` >= 200112L to be defined before including the appropriate header file (`stdlib.h`). As the other included header files include some standard headers...

msbit

Use float literals for `TEST_FLTEQ`

When using the `TEST_FLTEQ` macro, pass float literals for the comparision argument, to avoid errors of the form: error: absolute value function 'fabsf' given an argument of type 'double' but...

msbit

`fread_and_discard` vs `fseek`

It looks like `fread_and_discard` acts the same as [`fseek`](https://man7.org/linux/man-pages/man3/fseek.3.html). If so, this will replace calls to `fread_and_discard` with equivalent calls to `fseek` ✌️

msbit

Avoid possible infinite loop due to chunk ordering

Properly re-read the chunk ID when iterating through subsequent chunks. This avoids an infinite loop in the case where the `data` chunk doesn't immediately follow the `fmt ` chunk.

msbit

x86 Version Doesn't Default to English model

2

Running: `$ spchcat audio.wav` results in: ``` Warning: Model not found in /etc/spchcat/models/C/ Warning: Scorer not found in /etc/spchcat/models/C/ TensorFlow: v2.3.0-14-g4bdd3955115 Coqui STT: v1.1.0-0-gf3605e23 No model specified, cannot continue. Could...

dbreunig

Raspberry Pi Installation Prerequisites

8

People are going to have to have `sox` on their system to get this working: `$ sudo apt install sox`

dbreunig

Input sounds stream via stdin

Is it possible to supply the sound stream via stdin (or a pipe)? I need to make something like the following work in a shell script or similar: ``` arecord...

sanbee

Vosk

Have you looked into using Vosk as a backend? My tests with DeepSpeech, Coqui STT, and Vosk indicate that Vosk runs on older hardware and with higher accuracy.

TechnologyClassroom

spchcat
spchcat copied to clipboard

Metadata

Running on Standby Mode for File Input

Not working on a Raspberry Pi

Use feature test to expose `setenv`

Use float literals for `TEST_FLTEQ`

`fread_and_discard` vs `fseek`

Avoid possible infinite loop due to chunk ordering

x86 Version Doesn't Default to English model

Raspberry Pi Installation Prerequisites

Input sounds stream via stdin

Vosk

← Metadata

Owner

Metadata

spchcat spchcat copied to clipboard

Metadata

← Metadata

Owner

Metadata

spchcat
spchcat copied to clipboard