noisereduce MemoryError

I m on windows 10 and jupyter environment, the audio file lasted 30 minutes, so I cut the file in 10 seconds each and then continue, on the first chunk0 file came across MemoryError.

Is the file still large for the situation? I followed the link : https://colab.research.google.com/github/timsainb/noisereduce/blob/master/notebooks/1.0-test-noise-reduction.ipynb#scrollTo=E5UkLtmT3xy3, the sample only lasted four seconds.

Or the paramter tuning would help for this ?

myaudio = AudioSegment.from_file(('myaudio.wav') , "wav") 
chunk_length_ms = 10000 # pydub calculates in millisec
chunks = make_chunks(myaudio, chunk_length_ms) #Make chunks of one sec

#Export all of the individual chunks as wav files

for i, chunk in enumerate(chunks):
    chunk_name = "chunk{0}.wav".format(i)
    print ("exporting", chunk_name)
    chunk.export(chunk_name, format="wav")

data, rate = sf.read('chunk0.wav')
data = data

reduced_noise = nr.reduce_noise(y = data, sr=rate, n_std_thresh_stationary=1.5,stationary=True)

MemoryError: Unable to allocate 197. GiB for an array with shape (441000, 60002) and data type float64

Dec 28 '21 12:12 joingreat

I am getting the same error with a 800KB file that lasts 5 seconds in duration. This is the error I get:

MemoryError: Unable to allocate array with shape (199898, 60002) and data type float64

Dec 31 '21 22:12 vaastav

What is the shape of the input data?

Dec 31 '21 23:12 timsainb

The shape of my input data is (199898, 2) and sample rate is 44100

Jan 01 '22 00:01 vaastav

There should be no problem with data size, I've run the algorithm on 30 mins+ files (there are params to do chunking for you).

Can you try transposing the array? It might be that the input is channels x samples rather than samples x channels. If so I should make a PR to give a warning if a greater number of channels than samples are present.

Jan 01 '22 22:01 timsainb

@timsainb ,Thanks for the reply

What is the shape of the input data?

data[:3]

array([[ 0.00000000e+00, 0.00000000e+00], [ 0.00000000e+00, 0.00000000e+00], [-3.05175781e-05, 0.00000000e+00]])

data.shape (441000, 2)

rate 44100

There should be no problem with data size, I've run the algorithm on 30 mins+ files (there are params to do chunking for you).

Can you try transposing the array? It might be that the input is channels x samples rather than samples x channels. If so I should make a PR to give a warning if a greater number of channels than samples are present.

reduced_noise = nr.reduce_noise(y = data.T, sr=rate, n_std_thresh_stationary=1,stationary=True)
IPython.display.Audio(data=reduced_noise, rate=rate)

After transposing the data to data.T for the input as above, the bug fixed, however the result looks mess and even more noisy.

Jan 04 '22 05:01 joingreat

@joingreat Your audio is not mono. Please try after converting your audio to mono channel.

Jan 04 '22 05:01 vinodhian

The package should be able to handle long and multi-channel data fine. Examples:

Long: https://colab.research.google.com/github/timsainb/noisereduce/blob/master/notebooks/1.0-test-noise-reduction.ipynb#scrollTo=a2stIgrUlX2h

Multichannel: https://colab.research.google.com/github/timsainb/noisereduce/blob/master/notebooks/1.0-test-noise-reduction.ipynb#scrollTo=9G0YvxDplX2k

Can either of you try to reproduce your error in a colab notebook so I can take a look.

Jan 04 '22 19:01 timsainb

I had the same issue and I fixed it by using .T

I found it from these two stack posts

https://stackoverflow.com/questions/57137050/error-passing-wav-file-to-ipython-display/57137391 https://stackoverflow.com/questions/40822877/scipy-io-cant-write-wavfile

Jan 05 '22 22:01 jd907

import IPython from scipy.io import wavfile import noisereduce as nr from pydub import AudioSegment from tinytag import TinyTag import soundfile as sf from noisereduce.generate_noise import band_limited_noise import matplotlib.pyplot as plt import urllib.request import numpy as np import io %matplotlib inline

chunk0.zip

data, rate = sf.read('chunk0.wav') data = data data.shape

reduced_noise = nr.reduce_noise(y = data.T, sr=rate, n_std_thresh_stationary=1,stationary=True) IPython.display.Audio(data=reduced_noise, rate=rate)

The chunk0.wav was uploaded above as chunk0.zip, reduced_noise would be ok after the .T transpose, the result looks a litter bit harsh especially at the beginning.

May be the split action hurt the file?

Jan 07 '22 11:01 joingreat

I am not sure that it's related, but I too had that problem, and after transposing the data, the function reduce_noise() seemed too work well but I got this error afterwards in wavfile.write(): struct.error: ushort format requires 0 <= number <= (0x7fff * 2 + 1)

The code: (example.wav is inside zipped.zip) zipped.zip

from scipy.io import wavfile
import noisereduce as nr
# load data
rate, data = wavfile.read("example.wav")
# perform noise reduction
reduced_noise = nr.reduce_noise(y=data.T, sr=rate)
wavfile.write("reduced.wav", rate, reduced_noise)

Feb 23 '22 09:02 hananell

Update: My wav file was stereo, I noticed that with mono everything works fine, so I seperated the data to two monos, run the algorithm on both, then stitched them back together. worked perfectly.

# load data
rate, data = wavfile.read("example.wav")
data1 = data[:,0]
data2 = data[0:,1]
# perform noise reduction
reduced_noise1 = nr.reduce_noise(y=data1, sr=rate)
reduced_noise2 = nr.reduce_noise(y=data2, sr=rate)
reduced_noise = np.stack((reduced_noise1, reduced_noise2), axis=1)
wavfile.write("reduced.wav", rate, reduced_noise)

Feb 23 '22 15:02 hananell

@hananell why does it work with mono? and does it affect that the audio is being split into 2 then merged back?

Apr 27 '23 04:04 netotz

@hananell why does it work with mono? and does it affect that the audio is being split into 2 then merged back?

seems Stereo input needs the format of (n_channles, frames), which is different from the output of soundfile. So you need to do something

Jun 01 '23 06:06 lixinghe1999

Had the same issue, converting the input wav to mono fixed it. EDIT: Found a one-liner to just take the left channel:

rate, data = wavfile.read(fname)
data = data[:, 0]

data.shape (1757089, 2) ~40 seconds, stereo 44100Hz

Traceback (most recent call last):
  File "C:\Users\MSI\Desktop\loo_test\fix.py", line 7, in <module>
    reduced_noise = nr.reduce_noise(y=data, sr=rate)
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\MSI\AppData\Local\Programs\Python\Python311\Lib\site-packages\noisereduce\noisereduce.py", line 597, in reduce_noise
    return sg.get_traces()
           ^^^^^^^^^^^^^^^
  File "C:\Users\MSI\AppData\Local\Programs\Python\Python311\Lib\site-packages\noisereduce\noisereduce.py", line 235, in get_traces
    filtered_chunk = self.filter_chunk(start_frame=0, end_frame=end_frame)
                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\MSI\AppData\Local\Programs\Python\Python311\Lib\site-packages\noisereduce\noisereduce.py", line 165, in filter_chunk
    padded_chunk = self._read_chunk(i1, i2)
                   ^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\MSI\AppData\Local\Programs\Python\Python311\Lib\site-packages\noisereduce\noisereduce.py", line 157, in _read_chunk
    chunk = np.zeros((self.n_channels, i2 - i1))
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
numpy.core._exceptions._ArrayMemoryError: Unable to allocate 786. GiB for an array with shape (1757089, 60002) and data type float64

Cool project by the way! I used an AI powered online tool and it worked significantly better, but it introduced weird hallucinations (and they want to charge $500 to process all the audio xD)

Jun 26 '23 23:06 a-n-d-a-i