audio
audio copied to clipboard
Data manipulation and transformation for audio signal processing, powered by PyTorch
### 🐛 Describe the bug Hi there, I am trying to build torchaudio v0.9.0 with pytorch 1.9 from source (looks like [this post](https://github.com/pytorch/audio/issues/2194) is trying to do the same thing),...
### 🚀 The feature It would be very helpful to provide the following interface for the beamforming module ([torchaudio.transforms.MVDR](https://pytorch.org/audio/master/transforms.html#torchaudio.transforms.MVDR.forward)): ```python forward(specgram: torch.Tensor, psd_s: torch.Tensor, psd_n: torch.Tensor) → torch.Tensor ``` and...
### 🚀 The feature Looks like spectrogram transform does not work with float16 precision. versions: python 3.8 torch 1.10.1 torchaudio 0.10.1 typing-extensions: 3.10.0.2 OS: ubuntu 20.04 My code to test...
hello there, I want to run this tts with Bangla language. I have the dataset on my google drive . Can you tell me if I run your model in...
### 🚀 The feature GPU audio decoding at least for some codecs is useful for wider usage of compressed audio for training ASR models. Maybe some neural codecs (I think...
PyTorch core has download function and `torch.hub.download_url_to_file`. Torchaudio can use it for dataset download and does not need to maintain its own `torchaudio.datasets.utils.download_url`. In addition to that, there seems to...
## 🚀 Feature Current version of istft force check NOLA thus some kind of padding (e.g. asymmetrical padding or #427 ) will raise assertion error. I would like to see...
PyTorch v1.8.2 LTS has been released. The corresponding `torchaudio` version is presumably `v0.8.2`, but there exists no such official release. This has been an issue for us, [as commented here](https://github.com/pytorch/pytorch.github.io/issues/828#issuecomment-941370876)....
### 🐛 Describe the bug Hello! I tried using created in #1681 `torchaudio.functional.filtfilt` instead of `scipy.signal.filtfilt`. In my observation there is no such combination of arguments that filtered output is...
It was mentioned in #1059 that it might be good to support a narrow range of frequencies to find the spectral centroid. This is an attempt at that. Apologies if...