bark icon indicating copy to clipboard operation
bark copied to clipboard

new issue

Open tomas122727 opened this issue 2 years ago • 9 comments

this is when im generating the audio ? segment_text: hello i am ai voice --Segment 1/1: est. 2.00s (1 of 1 iterations) hello i am ai voice Traceback (most recent call last): File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\gradio\routes.py", line 422, in run_predict output = await app.get_blocks().process_api( File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\gradio\blocks.py", line 1323, in process_api result = await self.call_function( File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\gradio\blocks.py", line 1051, in call_function prediction = await anyio.to_thread.run_sync( File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\anyio\to_thread.py", line 33, in run_sync return await get_asynclib().run_sync_in_worker_thread( File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\anyio_backends_asyncio.py", line 877, in run_sync_in_worker_thread return await future File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\anyio_backends_asyncio.py", line 807, in run result = context.run(func, *args) File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\gradio\helpers.py", line 588, in tracked_fn response = fn(*args) File "C:\Users\tomas_000\Desktop\bark-main\bark_webui.py", line 432, in generate_audio_long_gradio full_generation_segments, audio_arr_segments, final_filename_will_be = api.generate_audio_long_from_gradio(**kwargs) File "C:\Users\tomas_000\Desktop\bark-main\bark_infinity\api.py", line 829, in generate_audio_long_from_gradio full_generation_segments, audio_arr_segments, final_filename_will_be = generate_audio_long(**kwargs) File "C:\Users\tomas_000\Desktop\bark-main\bark_infinity\api.py", line 1123, in generate_audio_long write_one_segment(audio_arr = np.concatenate(audio_arr_segments), full_generation = full_generation_segments[0], **kwargs) File "C:\Users\tomas_000\Desktop\bark-main\bark_infinity\api.py", line 360, in write_one_segment write_seg_wav(filepath, audio_arr, **kwargs) File "C:\Users\tomas_000\Desktop\bark-main\bark_infinity\api.py", line 409, in write_seg_wav write_audiofile(filepath, audio_arr, **kwargs) File "C:\Users\tomas_000\Desktop\bark-main\bark_infinity\api.py", line 436, in write_audiofile audio = AudioSegment.from_wav(temp_wav) File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\pydub\audio_segment.py", line 808, in from_wav return cls.from_file(file, 'wav', parameters=parameters) File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\pydub\audio_segment.py", line 728, in from_file info = mediainfo_json(orig_file, read_ahead_limit=read_ahead_limit) File "C:\Users\tomas_000\AppData\Roaming\Python\Python310\site-packages\pydub\utils.py", line 274, in mediainfo_json res = Popen(command, stdin=stdin_parameter, stdout=PIPE, stderr=PIPE) File "C:\Program Files\Python310\lib\subprocess.py", line 971, in init self._execute_child(args, executable, preexec_fn, close_fds, File "C:\Program Files\Python310\lib\subprocess.py", line 1440, in _execute_child hp, ht, pid, tid = _winapi.CreateProcess(executable, args, FileNotFoundError: [WinError 2] The system cannot find the file specified

tomas122727 avatar Jun 05 '23 11:06 tomas122727

This means you need some audio for proper file formats. In fact the problem might be I switched the default from mp4 to mp3.

If you set it to wav, it will work for now. Or try mp4 instead of mp3. (mp3 is default)

Try

pip install av

or

conda install -c conda-forge ffmpeg
conda install -c conda-forge ffmpeg-python

JonathanFly avatar Jun 05 '23 12:06 JonathanFly

no this is when im using the regular tts ?

tomas122727 avatar Jun 05 '23 12:06 tomas122727

i choose the speaker and i just typed up text i am a bot just testing this and im trying to just generate audio from my text and it comes up with this error also is my bark infinity folder supposed to be 32 mb in size ?

tomas122727 avatar Jun 05 '23 12:06 tomas122727

The wav files are pretty big.

First try update:

git pull
mamba env update -f environment-cuda.yml
python -m pip install --upgrade pip
pip install --upgrade setuptools 
pip install -r requirements_conda_missing.txt

Then if that doesn't help try wav or mp4

--output_format wav

or

--output_format mp4

Capture




JonathanFly avatar Jun 05 '23 13:06 JonathanFly

You can also try install regular windows ffmpeg. Try to install it so if you type:

ffmpeg -version

it works where you run bark.

I will probably be in Bark Discord if you need help, end of Monday or Tuesday AM.

JonathanFly avatar Jun 05 '23 13:06 JonathanFly

I think I have a fix that works in most systems, more reliably.

pip install ffmpeg-downloader
ffdl install --add-path 

That install ffmpeg via python library designed for it, and seems to do it in a way where torch and python can more reliably find and use it. After you run those commands you have to exit the miniconda prompt (or the terminal if you aren't using conda or mamba at all) and restart.

Then type

ffmpeg -version
ffprobe -version

to make sure they work.

I'll to a quick update today and maybe a bigger install update...

JonathanFly avatar Jun 06 '23 14:06 JonathanFly

Can I have the link of discord group? I'm kinda overwhelmed with all of this button. 😅

Zyphyrs avatar Jun 07 '23 13:06 Zyphyrs

It's the Bark discord: https://github.com/suno-ai/bark link on the top of the readme.

JonathanFly avatar Jun 07 '23 13:06 JonathanFly

Thank you!🤝

Zyphyrs avatar Jun 07 '23 13:06 Zyphyrs