SpecAugment issues

spec_augment_pytorch has a bug

5

from SpecAugment.sparse_image_warp_zcaceres import sparse_image_warp ModuleNotFoundError: No module named 'SpecAugment

Run time gradually becomes slower if a large number of augmented spectrgrams are being generated using a for loop

When more than 1000 audio files are being fed to this tensorflow SpecAugment code using a python loop, the execution times gradually become very slow. Is there any way to...

spandandey21

ImportError: Cannot load backend 'TkAgg' which requires the 'tk' interactive framework, as 'headless' is currently running

5

Hi, First of all love the Library. I am having problems with importing spec_augment_pytorch. The issue is with importing matplotlib same as this post: [link](https://stackoverflow.com/questions/55811545/importerror-cannot-load-backend-tkagg-which-requires-the-tk-interactive-fra) But as its in the...

harrygcoppock

Fixed typo in import for pytorch

kalfasyan

from mel_spectrogram to wav again

26

Hi, Do you have any suggestion about how to re-build the audio file after augmentation?

kimchi88

Is it possible to use this algorithm to multiply audio files for a text-to-speech dataset ?

1

williamgun007

Question about normalization

1

Hello, I'm wondering how the log-mel spctrograms are normalized to have zero mean value. The paper mentioned about it, and the masking value is 0 because they have zero mean....

wade3han

IndexError: tuple index out of range on spec_augment_pytorch

9

Any idea on how to resolve this problem? ``` from SpecAugment import spec_augment_pytorch as spec_augment_l import librosa y, sr = librosa.load(dataset_audio_path.joinpath(audio_path), sr=16000) y, _ = librosa.effects.trim(y, top_db=15) S = librosa.feature.melspectrogram(y,...

iskorini

librosa.feature.melspectrogram is not equal as kaldi's mfcc extraction

2

the same audio, when I use librosa.feature.melspectrogram to extracte log mfcc, says Matrix A(43,N). But it turns out that Matrix A is not equal as kaldi's mfcc extraction, says Matrix...

KnowBetterHelps

Code maintenance

1

Hi, I was trying to use SpecAugment (just run a simple augmentation for now). I was able to install the package, but it seems to have many code issues like...

yardenkarny

SpecAugment
SpecAugment copied to clipboard

Metadata

spec_augment_pytorch has a bug

Run time gradually becomes slower if a large number of augmented spectrgrams are being generated using a for loop

ImportError: Cannot load backend 'TkAgg' which requires the 'tk' interactive framework, as 'headless' is currently running

Fixed typo in import for pytorch

from mel_spectrogram to wav again

Is it possible to use this algorithm to multiply audio files for a text-to-speech dataset ?

Question about normalization

IndexError: tuple index out of range on spec_augment_pytorch

librosa.feature.melspectrogram is not equal as kaldi's mfcc extraction

Code maintenance

← Metadata

Owner

Metadata

SpecAugment SpecAugment copied to clipboard

Metadata

← Metadata

Owner

Metadata

SpecAugment
SpecAugment copied to clipboard