crank issues

Results 11 crank issues

Sort by recently updated

parameters in the configuration files/improving model

are there any detailed informations to all the parameters in the config files and how they affect the audio? ``` conf/mlfb_vqvae.yml cobf/mflb_vqvae.yml ``` I left it all on default and...

talka1

documentation

How to convert new audio files after all stages are complete and model is trained?

I trained the model and want to test it by converting new audio files using the new trained model? How should I approach this?

talka1

KeyError: 'use_raw' Error at Stage 3

I cant start the training at stage 3 anymore: ``` # python -m crank.bin.train --flag train --n_jobs 10 --conf conf/mlfb_vqvae.yml --checkpoint None --scpdir data/scp --featdir data/feature --expdir exp # Started...

talka1

So as griffin_lim.py to find *.h5 for eval

--voc GL did not work because griffin_lim.py could not find *.h5 for eval. A quick hack for Griffin-Lim to work.

hirokisince1998

Stage 2 breaks at bad quality voice.

I am have bunch of real voice file. Some of it has real bad quality. At stage 2, utils.py, convert_continuos_f0(...), start_f0 = f0[f0 != 0][0] I am get exception "index...

Vadim2S

print() sometimes missing in log

I am add debug print(...) somewhere, for example, in feature.py _open_wavf(...) for print wav file name. Sometimes I am see output, sometimes not. Оbviously this is multithread related error. P.S....

Vadim2S

Release models or provide small librispeech recipe?

Since the VCC data is not commonly available could you either: * Release your pretrained models or * Add a VCTK / LibriSpeech recipe? (Since that data is available freely)

turian

ValueError: numpy.ndarray size changed

` Traceback (most recent call last): File "/opt/conda/lib/python3.7/runpy.py", line 193, in _run_module_as_main "__main__", mod_spec) File "/opt/conda/lib/python3.7/runpy.py", line 85, in _run_code exec(code, run_globals) File "/data/crank/crank/bin/extract_feature.py", line 18, in from crank.feature import...

980202006

Upload conversion samples

- [x] vcc2020 - [x] PWG - [ ] MCD - [ ] MOSNet - [x] vcc2018 - [x] PWG - [x] MCD - [x] MOSNet

k2kobayashi

enhancement

Feature

- [x] Add GL and neural vocoder samples for vcc2020v1 and vcc2018 recipes. - [x] Implement objective evaluation stage - [ ] Modify CI Anything else?

k2kobayashi

help wanted

crank
crank copied to clipboard

Metadata

parameters in the configuration files/improving model

How to convert new audio files after all stages are complete and model is trained?

KeyError: 'use_raw' Error at Stage 3

So as griffin_lim.py to find *.h5 for eval

Stage 2 breaks at bad quality voice.

print() sometimes missing in log

Release models or provide small librispeech recipe?

ValueError: numpy.ndarray size changed

Upload conversion samples

Feature

← Metadata

Owner

Metadata

crank crank copied to clipboard

Metadata

← Metadata

Owner

Metadata

crank
crank copied to clipboard