Haohe Liu / 刘濠赫 comments

Results 107 comments of


                                            Haohe Liu / 刘濠赫

How to get these files ? such as :f_2_64.mat/h_4_64.mat thanks

You can use this Matlab toolbox: https://www.mathworks.com/matlabcentral/fileexchange/40128-filter-bank-design.

Trainig code?

The training code will be available in this repo https://github.com/haoheliu/AudioLDM-training-finetuning. Will work on it when I got time.

Runtime error: espeak not installed on your system

Oh yes I forgot this dependency. You can do the following ``` sudo apt-get install espeak ```

how to use implement text2speech？

The checkpoint for Text-to-Speech will be released later.

Training code

@BestVicky Please checkout this repo https://github.com/haoheliu/AudioLDM-training-finetuning. Thanks

Style Transfer

AudioLDM 2 is also capable of style transfer. But some extra coding is needed to achieve this. You can refer to the style transfer code in AudioLDM v1 for more...

Inference code

The project is in active building. I'll add that in later.

Inference code

The inference code is ready now. Please checkout the main branch

Use pretrained model on 32kHz dataset to fintune on 16kHz dataset

Yes I think you can use your higher sampling rate audio (>16kHz) to finetune our 16kHz model. That sounds good to me.

Training on our own dataset.

It's super easy to use your own dataset. I'll update the readme with more tutorials later.