merlin icon indicating copy to clipboard operation
merlin copied to clipboard

acoustic_comp: the frame number of data stream lf0 is not consistent with others

Open PavanIISc opened this issue 8 years ago • 9 comments

I am getting this error. assert out_frame_number == frame_number AssertionError

(I am trying this system for Indian language) screenshot from 2017-02-17 12 54 09

Why am I getting this error. Can you please tell me what should I to change?

PavanIISc avatar Feb 17 '17 11:02 PavanIISc

As the error in your output says: acoustic_comp: the frame number of data stream lf0 is not consistent with others: current 2783 others 1393

So you have a mistmatched number of frames on the file.

RasmusD avatar Feb 17 '17 12:02 RasmusD

Did you use the WORLD vocoder to extract the lf0 values? How did you create the label_state_align files? The number of frames in each have to match. In WORLD lf0 values are computed every 5ms.

dreamk73 avatar Feb 17 '17 15:02 dreamk73

Make sure the number of frames in each of lf0, bap and mgc are same. Use "x2x" in SPTK to find out the number of frames.

./tools/bin/SPTK-3.9/x2x +fa lf0/cmu_us_arctic_slt_text_01001.lf0 | wc -l ./tools/bin/SPTK-3.9/x2x +fa bap/cmu_us_arctic_slt_text_01001.bap | wc -l (assuming 16khz) ./tools/bin/SPTK-3.9/x2x +fa mgc/cmu_us_arctic_slt_text_01001.mgc | xargs -n60 | wc -l

ronanki avatar Feb 18 '17 07:02 ronanki

@dreamk73 I have used WORLD vocoder "extract_features_for_merlin.sh" script to extract lf0, bap and mgc.

PavanIISc avatar Feb 20 '17 04:02 PavanIISc

@ronanki I have checked the number of frames. They are mismatching. How to extract lf0, bap and mgc? which script should I use?

PavanIISc avatar Feb 20 '17 04:02 PavanIISc

This is the script you should use: https://github.com/CSTR-Edinburgh/merlin/blob/master/misc/scripts/vocoder/world/extract_features_for_merlin.sh

Also set sampling frequency to either 16000Hz or 48000Hz w.r.t the data you are using, as the default value is 16000Hz: https://github.com/CSTR-Edinburgh/merlin/blob/master/misc/scripts/vocoder/world/extract_features_for_merlin.sh#L31

I hope this solves your problem.

ronanki avatar Feb 20 '17 06:02 ronanki

Also the wav files must be of single channel i.e. mono waves

abhigarg avatar Sep 20 '17 18:09 abhigarg

@ronanki Hello there. I've already converted my files to mono waves and sample rate of 16kHz.

I've used this script as I'm using the world_v2 vocoder: merlin/misc/scripts/vocoder/world_v2/copy_synthesis.sh

But the frames are still different. The lf0 and mgc has no problem. Only the bap frame is not the same. Can I know how to correct this, please?

nasmsh avatar Feb 07 '18 02:02 nasmsh

@nasmsh I am also facing the same problem of mismatched frames, despite using the correct scripts. How did you solve this?

anushaprakash90 avatar Oct 17 '18 05:10 anushaprakash90