mimic3 icon indicating copy to clipboard operation
mimic3 copied to clipboard

[SSML] Line break and white space causing artefacts before </s> tag

Open Jiivee1 opened this issue 1 year ago • 1 comments

Describe the bug When Mimic 3 plays SSML file, line break and white space at the end of spoken text before closing tag causes artefacts and noise to speech. I first noticed this when I tested SSML sample file found on page https://mycroft-ai.gitbook.io/docs/mycroft-technologies/mimic-tts/mimic-3.

To Reproduce mimic3_ssml_cracks_and_noise.zip Steps to reproduce the behavior:

  1. Unzip attached file.
  2. Play SSML file interactively: mimic3 --ssml --interactive < mimic3_ssml_cracks_and_noise.ssml
  3. Alternatively play attached mimic3_ssml_cracks_and_noise.wav file.
  4. Note noise and artefacts and compare what your hear to how its is presented in SSML file.

Expected behavior There should not be any noise artefacts after spoken text in any of the cases.

Log files n/a

Environment (please complete the following information):

  • Desktop PC (AMD Ryzen Threadripper 2950X CPU), laptop (Intel i7-10510U CPU)
  • OS: Opensuse Tumbleweed
  • Mycrofr-core version
$ zypper info mycroft-core 
Information for package mycroft-core:
-------------------------------------
Repository     : Main Repository (OSS) (20230202)
Name           : mycroft-core
Version        : 18.8.13-1.19
Arch           : noarch
Vendor         : openSUSE
Installed Size : 14.5 MiB
Installed      : Yes
Status         : up-to-date
Source package : mycroft-core-18.8.13-1.19.src
Upstream URL   : https://mycroft.ai
Summary        : The Mycroft Artificial Intelligence platform
Description    : 
    Mycroft is a voice assistant.
  • Python3 is python 3.10.9

I have not touched config files at all in neither of the machines.

Jiivee1 avatar Feb 05 '23 12:02 Jiivee1