audio icon indicating copy to clipboard operation
audio copied to clipboard

Data manipulation and transformation for audio signal processing, powered by PyTorch

Results 324 audio issues
Sort by recently updated
recently updated
newest added

### 🐛 Describe the bug Originated from here. **Pls do not hastily close until torchaudio is built succesfully. The official insruction does not have enough information to get the build...

### 🚀 The feature Suggest using `register_buffer()` with `persistent=False`, so the buffer (e.g. window of spectrogram) will not be included in module's state dict. ### Motivation, pitch When I add...

help wanted
good first issue
triaged

### 🐛 Describe the bug I notice that in https://github.com/pytorch/audio/blob/main/src/torchaudio/functional/functional.py#L561-L568, ```python if norm is not None and norm != "slaney": raise ValueError('norm must be one of None or "slaney"') #...

### 🐛 Describe the bug Hi, I have found a abnormal situation when I try to use torchaudio to resample 8KHz speech data to 16KHz. The code I am using...

### 🚀 The feature I've written an `AudioBlockReader` that wraps StreamReader to return chunks of audio that are padded left and right with valid data. ### Motivation, pitch Let's say...

### 🚀 The feature Convert power spectrogram into db and the reverse. ### Motivation, pitch I'm working on replacing my Librosa usage with torchaudio and this is a missing function...

### 🐛 Describe the bug ```python import torch from torchaudio.transforms import TimeMasking, FrequencyMasking x = torch.randn(80, 100) FrequencyMasking(10)(x) # this works TimeMasking(10)(x) # this doesn't ``` Error message ``` File...

### 🚀 The feature [TimeStretch](https://pytorch.org/audio/stable/generated/torchaudio.transforms.TimeStretch.html) is not in SpecAugment. It should be Time Warping instead. Time Warping does not change the spectrogram shape, but "warp" the content. I want to...

please support batch kaldi fbank computation/ "waveform (Tensor) – Tensor of audio of size (c, n) where c is in the range [0,2)" right now only single utt compute is...

Kaldi