STT icon indicating copy to clipboard operation
STT copied to clipboard

Bug: it writes text without dot and comma

Open davidak opened this issue 3 years ago • 3 comments

Describe the bug it writes text without dot and comma

To Reproduce Steps to reproduce the behavior:

tts --text "To help with the large amounts of pull requests, we would appreciate your reviews of other pull requests, especially simple package updates. Just leave a comment describing what you have tested in the relevant package/service. Reviewing helps to reduce the average time-to-merge for everyone. Thanks a lot if you do!"

[nix-shell:~]$ stt --model coqui-stt-0.9.3-models.pbmm --scorer coqui-stt-0.9.3-models.scorer --audio '/home/davidak/Downloads/tts-0.0.14.wav' 
TensorFlow: v2.3.0-6-g23ad988fcde
 Coqui STT: v0.10.0-alpha.4-74-g49cdf7a6
2021-05-21 23:11:10.956472: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN)to use the following CPU instructions in performance-critical operations:  AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
rate: rate clipped 1 samples; decrease volume?
to help with the large amounts of pull requests we would appreciate your reviews of other poll requests especially simple package up dates just leave a comment describing what you have tested in the relevant packages service reviewing helps to reduce the average time to merge for every one thanks a lot if you do

Expected behavior In the best case, it would be exactly the same as the input here.

I also tried with spoken words from me and it ignored pauses and intonation.

Environment (please complete the following information):

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): NixOS 21.05pre289526.7a1fbc38a4b
  • TensorFlow installed from (our builds, or upstream TensorFlow):
  • TensorFlow version (use command below): v2.3.0-6-g23ad988fcde
  • Python version:
  • Bazel version (if compiling from source):
  • GCC/Compiler version (if compiling from source):
  • CUDA/cuDNN version:
  • GPU model and memory:
  • Exact command to reproduce:

davidak avatar May 21 '21 21:05 davidak

That's not a bug, we normalize the training data to remove punctuation. It may be a feature request :)

reuben avatar May 22 '21 07:05 reuben

@davidak It's possible to train a model in which the training data is normalized to include punctuation. Such a model will include punctuation in its transcription.

So if you feel up to it you could train your own model to transcribe with punctuation!

kdavis-coqui avatar May 22 '21 09:05 kdavis-coqui

in that case take it as a feature request

Am 22. Mai 2021 09:07:54 MESZ schrieb Reuben Morais @.***>:

That's not a bug, we normalize the training data to remove punctuation. It may be a feature request :)

-- You are receiving this because you authored the thread. Reply to this email directly or view it on GitHub: https://github.com/coqui-ai/STT/issues/1873#issuecomment-846366343

-- Diese Nachricht wurde von meinem Android-Gerät mit K-9 Mail gesendet.

davidak avatar May 25 '21 13:05 davidak