audio
audio copied to clipboard
Data manipulation and transformation for audio signal processing, powered by PyTorch
### 🐛 Describe the bug Originated from here. **Pls do not hastily close until torchaudio is built succesfully. The official insruction does not have enough information to get the build...
### 🚀 The feature Suggest using `register_buffer()` with `persistent=False`, so the buffer (e.g. window of spectrogram) will not be included in module's state dict. ### Motivation, pitch When I add...
### 🐛 Describe the bug I notice that in https://github.com/pytorch/audio/blob/main/src/torchaudio/functional/functional.py#L561-L568, ```python if norm is not None and norm != "slaney": raise ValueError('norm must be one of None or "slaney"') #...
### 🐛 Describe the bug Hi, I have found a abnormal situation when I try to use torchaudio to resample 8KHz speech data to 16KHz. The code I am using...
### 🚀 The feature I've written an `AudioBlockReader` that wraps StreamReader to return chunks of audio that are padded left and right with valid data. ### Motivation, pitch Let's say...
### 🚀 The feature Convert power spectrogram into db and the reverse. ### Motivation, pitch I'm working on replacing my Librosa usage with torchaudio and this is a missing function...
### 🐛 Describe the bug ```python import torch from torchaudio.transforms import TimeMasking, FrequencyMasking x = torch.randn(80, 100) FrequencyMasking(10)(x) # this works TimeMasking(10)(x) # this doesn't ``` Error message ``` File...
### 🚀 The feature [TimeStretch](https://pytorch.org/audio/stable/generated/torchaudio.transforms.TimeStretch.html) is not in SpecAugment. It should be Time Warping instead. Time Warping does not change the spectrogram shape, but "warp" the content. I want to...
please support batch kaldi fbank computation/ "waveform (Tensor) – Tensor of audio of size (c, n) where c is in the range [0,2)" right now only single utt compute is...