Retrieval-based-Voice-Conversion-WebUI icon indicating copy to clipboard operation
Retrieval-based-Voice-Conversion-WebUI copied to clipboard

[help wanted]ffmpeg._run.Error: ffmpeg error (see stderr output for detail)

Open YukiWorks432 opened this issue 1 year ago • 12 comments

What do these errors represent, stderr is of no help. How can I solve this?

OS: Windows 10 CPU: AMD Ryzen7 3700X Memory: 32GB GPU: NVIDIA GeForce RTX 2060

Traceback (most recent call last):
  File "F:\RVC-beta-v2-0528\my_utils.py", line 14, in load_audio
    ffmpeg.input(file, threads=0)
  File "F:\RVC-beta-v2-0528\runtime\lib\site-packages\ffmpeg\_run.py", line 325, in run
    raise Error('ffmpeg', out, err)
ffmpeg._run.Error: ffmpeg error (see stderr output for detail)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "F:\RVC-beta-v2-0528\infer-web.py", line 161, in vc_single
    audio = load_audio(input_audio_path, 16000)
  File "F:\RVC-beta-v2-0528\my_utils.py", line 19, in load_audio
    raise RuntimeError(f"Failed to load audio: {e}")
RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail)

Traceback (most recent call last):
  File "F:\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\routes.py", line 321, in run_predict
    output = await app.blocks.process_api(
  File "F:\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\blocks.py", line 1007, in process_api
    data = self.postprocess_data(fn_index, result["prediction"], state)
  File "F:\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\blocks.py", line 953, in postprocess_data
    prediction_value = block.postprocess(prediction_value)
  File "F:\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\components.py", line 2076, in postprocess
    processing_utils.audio_to_file(sample_rate, data, file.name)
  File "F:\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\processing_utils.py", line 206, in audio_to_file
    data = convert_to_16_bit_wav(data)
  File "F:\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\processing_utils.py", line 219, in convert_to_16_bit_wav
    if data.dtype in [np.float64, np.float32, np.float16]:
AttributeError: 'NoneType' object has no attribute 'dtype'

YukiWorks432 avatar May 30 '23 05:05 YukiWorks432

i get this exact issue and my path to the audio to convert is C:\Users\lucky\Desktop\Audio so it doesnt seem to be a path issue? exact same error message

VioletChaolan avatar May 30 '23 17:05 VioletChaolan

after getting the stderr i get this in an admin terminal i literally cant give it more permission than it has. even in the security tab i gave full control and modify to the folder

`C:\Users\lucky\Desktop\RVC-beta-v2-0528>runtime\python.exe infer-web.py --pycmd runtime\python.exe --port 7897 Use Language: en_US Running on local URL: http://0.0.0.0:7897 loading weights/CSGOSAS_e10.pth gin_channels: 256 self.spk_embed_dim: 109 <All keys matched successfully> stdout: stderr: ffmpeg version n4.3.2-160-gfbb9368226 Copyright (c) 2000-2021 the FFmpeg developers built with gcc 9.3-win32 (GCC) 20200320 configuration: --prefix=/ffbuild/prefix --pkg-config-flags=--static --pkg-config=pkg-config --cross-prefix=x86_64-w64-mingw32- --arch=x86_64 --target-os=mingw32 --enable-version3 --disable-debug --disable-w32threads --enable-pthreads --enable-iconv --enable-zlib --enable-libxml2 --enable-libfreetype --enable-libfribidi --enable-gmp --enable-lzma --enable-fontconfig --enable-opencl --enable-libvmaf --disable-vulkan --enable-libvorbis --enable-amf --enable-libaom --disable-avisynth --enable-libdav1d --disable-libdavs2 --enable-ffnvcodec --enable-cuda-llvm --disable-libglslang --enable-libass --enable-libbluray --enable-libmp3lame --enable-libopus --enable-libtheora --enable-libvpx --enable-libwebp --disable-lv2 --enable-libmfx --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-librav1e --disable-librubberband --enable-schannel --enable-sdl2 --enable-libsoxr --enable-libsrt --enable-libtwolame --disable-libvidstab --disable-libx264 --disable-libx265 --disable-libxavs2 --disable-libxvid --enable-libzimg --extra-cflags=-DLIBTWOLAME_STATIC --extra-cxxflags= --extra-ldflags=-pthread --extra-libs=-lgomp libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 C:\Audio: Permission denied

Traceback (most recent call last): File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\infer-web.py", line 161, in vc_single audio = load_audio(input_audio_path, 16000) File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\my_utils.py", line 22, in load_audio raise e File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\my_utils.py", line 14, in load_audio ffmpeg.input(file, threads=0) File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\runtime\lib\site-packages\ffmpeg_run.py", line 325, in run raise Error('ffmpeg', out, err) ffmpeg._run.Error: ffmpeg error (see stderr output for detail)

Traceback (most recent call last): File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\routes.py", line 321, in run_predict output = await app.blocks.process_api( File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\blocks.py", line 1007, in process_api data = self.postprocess_data(fn_index, result["prediction"], state) File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\blocks.py", line 953, in postprocess_data prediction_value = block.postprocess(prediction_value) File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\components.py", line 2076, in postprocess processing_utils.audio_to_file(sample_rate, data, file.name) File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\processing_utils.py", line 206, in audio_to_file data = convert_to_16_bit_wav(data) File "C:\Users\lucky\Desktop\RVC-beta-v2-0528\runtime\lib\site-packages\gradio\processing_utils.py", line 219, in convert_to_16_bit_wav if data.dtype in [np.float64, np.float32, np.float16]: AttributeError: 'NoneType' object has no attribute 'dtype'`

VioletChaolan avatar May 30 '23 19:05 VioletChaolan

alright gotta apoloigze it seems the actual issue is i gave it the file path without acctualy supplying the filename. i did this C:\Audio wheen it shouldve been C:\Audio\bucklemyshoe.wav

VioletChaolan avatar May 30 '23 20:05 VioletChaolan

alright gotta apoloigze it seems the actual issue is i gave it the file path without acctualy supplying the filename. i did this C:\Audio wheen it shouldve been C:\Audio\bucklemyshoe.wav

I tried but could not solve the problem. Is there anything else you have changed?

YukiWorks432 avatar May 31 '23 02:05 YukiWorks432

Start the webui with an admin console I guess?

VioletChaolan avatar May 31 '23 06:05 VioletChaolan

Start the webui with an admin console I guess?

It didn't solve my case...

YukiWorks432 avatar May 31 '23 08:05 YukiWorks432

I will add the details of the error.

This occurred when converting an audio file. In "Inferencing voice" I specified the pth file I downloaded and in "Enter the path of the audio file to be processed" I entered "F:\output.wav". Then I specified the wav file and clicked "Convert", and it took about 5 seconds before an error message appeared.

YukiWorks432 avatar Jun 03 '23 03:06 YukiWorks432

I had the same problem, but it worked fine for me before. I don't know what happened to cause the error

reflectt6 avatar Jun 04 '23 12:06 reflectt6

I printed the err and the key information is as follows:

"[lrc @ 000001cff9f144c0] Format lrc detected only with low score of 5, misdetection possible! Input #0, lrc, from 'F:\RVC\train\new/desktop.ini': Duration: N/A, bitrate: N/A Stream #0:0: Subtitle: text Codec AVOption threads (set the number of threads) specified for input file #0 (F:\RVC\train\new/desktop.ini) has not been used for any stream. The most likely reason is either wrong type (e.g. a video option with no video streams) or that it is a private option of some decoder which was not actually used for any stream. Output #0, f32le, to 'pipe:': Output file #0 does not contain any stream"

You can see that ffmpeg mistakenly treated audio files as lrc files. And I didn't notice that the audio source folder contains the "desktop. ini" file, which is a strange error.

Although I don't understand why there is this bug, creating a new folder and copying audio files to the new folder can avoid this problem.

reflectt6 avatar Jun 04 '23 13:06 reflectt6

same error, moving file sometimes fixes it and sometimes doesn't. it's strange.

Traceback (most recent call last):
  File "L:\RCV_voiceclone\my_utils.py", line 14, in load_audio
    ffmpeg.input(file, threads=0)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\ffmpeg\_run.py", line 325, in run
    raise Error('ffmpeg', out, err)
ffmpeg._run.Error: ffmpeg error (see stderr output for detail)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "L:\RCV_voiceclone\infer-web.py", line 161, in vc_single
    audio = load_audio(input_audio_path, 16000)
  File "L:\RCV_voiceclone\my_utils.py", line 19, in load_audio
    raise RuntimeError(f"Failed to load audio: {e}")
RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail)

Traceback (most recent call last):
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\routes.py", line 321, in run_predict
    output = await app.blocks.process_api(
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\blocks.py", line 1007, in process_api
    data = self.postprocess_data(fn_index, result["prediction"], state)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\blocks.py", line 953, in postprocess_data
    prediction_value = block.postprocess(prediction_value)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\components.py", line 2076, in postprocess
    processing_utils.audio_to_file(sample_rate, data, file.name)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\processing_utils.py", line 206, in audio_to_file
    data = convert_to_16_bit_wav(data)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\processing_utils.py", line 219, in convert_to_16_bit_wav
    if data.dtype in [np.float64, np.float32, np.float16]:
AttributeError: 'NoneType' object has no attribute 'dtype'
Traceback (most recent call last):
  File "L:\RCV_voiceclone\my_utils.py", line 14, in load_audio
    ffmpeg.input(file, threads=0)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\ffmpeg\_run.py", line 325, in run
    raise Error('ffmpeg', out, err)
ffmpeg._run.Error: ffmpeg error (see stderr output for detail)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "L:\RCV_voiceclone\infer-web.py", line 161, in vc_single
    audio = load_audio(input_audio_path, 16000)
  File "L:\RCV_voiceclone\my_utils.py", line 19, in load_audio
    raise RuntimeError(f"Failed to load audio: {e}")
RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail)

Traceback (most recent call last):
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\routes.py", line 321, in run_predict
    output = await app.blocks.process_api(
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\blocks.py", line 1007, in process_api
    data = self.postprocess_data(fn_index, result["prediction"], state)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\blocks.py", line 953, in postprocess_data
    prediction_value = block.postprocess(prediction_value)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\components.py", line 2076, in postprocess
    processing_utils.audio_to_file(sample_rate, data, file.name)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\processing_utils.py", line 206, in audio_to_file
    data = convert_to_16_bit_wav(data)
  File "L:\RCV_voiceclone\runtime\lib\site-packages\gradio\processing_utils.py", line 219, in convert_to_16_bit_wav
    if data.dtype in [np.float64, np.float32, np.float16]:
AttributeError: 'NoneType' object has no attribute 'dtype'

hobolyra avatar Jun 04 '23 20:06 hobolyra

I got a similar issue on the preprocess:

OS: Windows 11 CPU: AMD Ryzen7 5800X Memory: 64GB GPU: NVIDIA GeForce RTX 4090

start preprocess
['trainset_preprocess_pipeline_print.py', 'F:\\RVC-beta-v2-0618\\audio\\us', '40000', '16', 'F:\\RVC-beta-v2-0618/logs/us', 'False']
F:\RVC-beta-v2-0618\audio\us/test_us.wav->Traceback (most recent call last):
  File "F:\RVC-beta-v2-0618\my_utils.py", line 14, in load_audio
    ffmpeg.input(file, threads=0)
  File "F:\RVC-beta-v2-0618\runtime\lib\site-packages\ffmpeg\_run.py", line 325, in run
    raise Error('ffmpeg', out, err)
ffmpeg._run.Error: ffmpeg error (see stderr output for detail)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "F:\RVC-beta-v2-0618\trainset_preprocess_pipeline_print.py", line 79, in pipeline
    audio = load_audio(path, self.sr)
  File "F:\RVC-beta-v2-0618\my_utils.py", line 19, in load_audio
    raise RuntimeError(f"Failed to load audio: {e}")
RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail)

ROBdk97 avatar Jun 28 '23 14:06 ROBdk97

I've trained my model with and without pitch guidance all in model v2 and v1. I Tried 0 protection 0.33 and 0.5, harvest crepe and pm give the same error. The path is good no spaces no () only - _

Anyone had luck in solving this?

Traceback (most recent call last):
  File "V:\AI\Retrieval-based-Voice-Conversion-WebUI\my_utils.py", line 14, in load_audio
    ffmpeg.input(file, threads=0)
  File "C:\tools\miniconda3\envs\rvc\lib\site-packages\ffmpeg\_run.py", line 325, in run
    raise Error('ffmpeg', out, err)
ffmpeg._run.Error: ffmpeg error (see stderr output for detail)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "V:\AI\Retrieval-based-Voice-Conversion-WebUI\infer-web.py", line 178, in vc_single
    audio = load_audio(input_audio_path, 16000)
  File "V:\AI\Retrieval-based-Voice-Conversion-WebUI\my_utils.py", line 19, in load_audio
    raise RuntimeError(f"Failed to load audio: {e}")
RuntimeError: Failed to load audio: ffmpeg error (see stderr output for detail)

Traceback (most recent call last):
  File "C:\tools\miniconda3\envs\rvc\lib\site-packages\gradio\routes.py", line 321, in run_predict
    output = await app.blocks.process_api(
  File "C:\tools\miniconda3\envs\rvc\lib\site-packages\gradio\blocks.py", line 1007, in process_api
    data = self.postprocess_data(fn_index, result["prediction"], state)
  File "C:\tools\miniconda3\envs\rvc\lib\site-packages\gradio\blocks.py", line 953, in postprocess_data
    prediction_value = block.postprocess(prediction_value)
  File "C:\tools\miniconda3\envs\rvc\lib\site-packages\gradio\components.py", line 2076, in postprocess
    processing_utils.audio_to_file(sample_rate, data, file.name)
  File "C:\tools\miniconda3\envs\rvc\lib\site-packages\gradio\processing_utils.py", line 206, in audio_to_file
    data = convert_to_16_bit_wav(data)
  File "C:\tools\miniconda3\envs\rvc\lib\site-packages\gradio\processing_utils.py", line 219, in convert_to_16_bit_wav
    if data.dtype in [np.float64, np.float32, np.float16]:
AttributeError: 'NoneType' object has no attribute 'dtype'

leaper-max avatar Jun 29 '23 08:06 leaper-max