STT icon indicating copy to clipboard operation
STT copied to clipboard

Bug: web site stt.readthedocs.io doesn't have working info on getting started with training

Open FrontierDK opened this issue 2 years ago • 1 comments

Hi there.

I have just tried a lot of the methods shown on stt.readthedocs.io on how to get started with training...and none of them worked. Will the web site be updated?

Background To make sure I didn't have old stuff, I used a fresh virtual Ubuntu 21.10 desktop, with unddo/snapshots of the VM, so that I could try multiple ways/methods, without having old stuff from previous attempts left over...

https://stt.readthedocs.io/en/latest/TRAINING_INTRO.html

  • The build docker from source using Dockerfile.train doesn't work. Throws tons of errors during creation, and can't start.

https://stt.readthedocs.io/en/latest/DEPLOYMENT.html

  • Doesn't say how to install prerequisites, when done and running coqui-stt-model-manager install, it throws this error: legacy-install-failure. Encountered error while trying to install package webrtcvad

https://www.tensorflow.org/install/source#configure_the_build

  • The command pip install -U --user pip numpy wheel packaging fails with "Requirement already satisfied: pip in /usr/lib/python3/dist-packages (20.3.4)". Remember, I'm using a fresh/newly installed VM/PC. The Ubuntu server does get through this, but then we get to the "./configure" command, which tells you to install Baxel. Not told anywhere on the page.

Preparing and training data takes a LOT of time, A lot more trained datasets would be available if the documetation is updated to a level, where you can follow the command guides from a newly installed PC/VM, use just the listed commands...and end out with a working SR PC/VM. A lot of training data might even be shared, so that our models are more versatile. Is there a chance that the documentation is updated?

FrontierDK avatar Mar 19 '22 11:03 FrontierDK

@FrontierDK is this the error you get with building the dockerfile? https://github.com/coqui-ai/STT/issues/2168

FWIW -- the best way to start training is to pull down the pre-built docker images:

$ docker pull ghcr.io/coqui-ai/stt-train

this is actually the recommended training setup, as noted in the docs... did this not work for you for some reason?

JRMeyer avatar Mar 23 '22 03:03 JRMeyer