Wav2Lip icon indicating copy to clipboard operation
Wav2Lip copied to clipboard

mel() error

Open EricKong1985 opened this issue 1 year ago • 5 comments

when I run the project in the colab,

i will show the error below:

Using cuda for inference. Reading video frames... Number of frames available for inference: 223 Traceback (most recent call last): File "/content/Wav2Lip/inference.py", line 280, in main() File "/content/Wav2Lip/inference.py", line 225, in main mel = audio.melspectrogram(wav) File "/content/Wav2Lip/audio.py", line 47, in melspectrogram S = _amp_to_db(_linear_to_mel(np.abs(D))) - hp.ref_level_db File "/content/Wav2Lip/audio.py", line 95, in _linear_to_mel _mel_basis = _build_mel_basis() File "/content/Wav2Lip/audio.py", line 100, in _build_mel_basis return librosa.filters.mel(hp.sample_rate, hp.n_fft, n_mels=hp.num_mels, TypeError: mel() takes 0 positional arguments but 2 positional arguments (and 3 keyword-only arguments) were given

EricKong1985 avatar Aug 30 '23 16:08 EricKong1985

Don't know if it will helpful for you but I have created a repository that fixes the running for inference using the pretrained model. here is the link https://github.com/HassanMuhammadSannaullah/Wav2lip-Fix-For-Inference

pip install librosa == 0.8.0

xugaoxiang avatar Sep 07 '23 11:09 xugaoxiang

when using librosa>=0.10.0, you should fix “_build_mel_basis()” in audio.py


def _build_mel_basis():
    assert hp.fmax <= hp.sample_rate // 2
    return librosa.filters.mel(sr=hp.sample_rate, n_fft=hp.n_fft, n_mels=hp.num_mels,
                               fmin=hp.fmin, fmax=hp.fmax)

er1cw00 avatar Sep 11 '23 07:09 er1cw00

when using librosa>=0.10.0, you should fix “_build_mel_basis()” in audio.py

def _build_mel_basis():
    assert hp.fmax <= hp.sample_rate // 2
    return librosa.filters.mel(sr=hp.sample_rate, n_fft=hp.n_fft, n_mels=hp.num_mels,
                               fmin=hp.fmin, fmax=hp.fmax)

Thanks for sharing, this change works for me

vimgaa avatar Dec 01 '23 03:12 vimgaa

when using librosa>=0.10.0, you should fix “_build_mel_basis()” in audio.py

def _build_mel_basis():
    assert hp.fmax <= hp.sample_rate // 2
    return librosa.filters.mel(sr=hp.sample_rate, n_fft=hp.n_fft, n_mels=hp.num_mels,
                               fmin=hp.fmin, fmax=hp.fmax)

You're a lifesaver! Thank you very much :)

Aml-Hassan-Abd-El-hamid avatar Jan 20 '24 00:01 Aml-Hassan-Abd-El-hamid