audio
audio copied to clipboard
Data manipulation and transformation for audio signal processing, powered by PyTorch
### 🐛 Describe the bug ```python from io import BytesIO import torch import torchaudio torch.manual_seed(0) torchaudio.set_audio_backend("sox_io") sr = 16000 N = sr # in case you can't reproduce, try increasing...
This PR to keep track on the progress of implementing the silence function in sox (#260). Feel free to suggest any changes to the implementation :)
### 🐛 Describe the bug I built torchaudio from source following https://github.com/pytorch/audio/blob/main/CONTRIBUTING.md. The build was successful, but when I import torchaudio in python, I got the following error: ``` >>>...
Added workflow for building torchaudio wheels.
### 🚀 The feature To increase the speed of `InverseMelScale` module, the SGD optimization can be replace with ` torch.linalg.lstsq`. ### Motivation, pitch The current `InverseMelScale` module applies SGD optimizer...
### 🐛 Describe the bug It is written pretty clearly in the release notes that there is a breaking change when loading MP3 files: ``` MP3 decoding is now handled...
Addresses feedback from pytorch#632 Closes vincentqb/audio#2
For release 2.0, we plan to add support for multi-channel room impulse response simulation methods under `torchaudio.functional`. The implementation is based on [pyroomacoustics](https://github.com/LCAV/pyroomacoustics), that supports both "image source method", and...
### 🐛 Describe the bug for the function "torchaudio.compliance.kaldi.fbank", there is an option "dither". The function calls _get_window() function, where dither leads to adding random number in strided_input:  Since...