audio issues

error when building torchaudio v0.9.0 from source on ppc64le (power9)

8

### 🐛 Describe the bug Hi there, I am trying to build torchaudio v0.9.0 with pytorch 1.9 from source (looks like [this post](https://github.com/pytorch/audio/issues/2194) is trying to do the same thing),...

j4sonzhao

New interface for MVDR beamforming

8

### 🚀 The feature It would be very helpful to provide the following interface for the beamforming module ([torchaudio.transforms.MVDR](https://pytorch.org/audio/master/transforms.html#torchaudio.transforms.MVDR.forward)): ```python forward(specgram: torch.Tensor, psd_s: torch.Tensor, psd_n: torch.Tensor) → torch.Tensor ``` and...

Emrys365

Spectrogram transform with float16 precision

2

### 🚀 The feature Looks like spectrogram transform does not work with float16 precision. versions: python 3.8 torch 1.10.1 torchaudio 0.10.1 typing-extensions: 3.10.0.2 OS: ubuntu 20.04 My code to test...

Guillaume-oso

how do i train the model for different language?

2

hello there, I want to run this tts with Bangla language. I have the dataset on my google drive . Can you tell me if I run your model in...

FarhadGazi

[dicscussion] Batched CPU/GPU audio decoding / encoding

8

### 🚀 The feature GPU audio decoding at least for some codecs is useful for wider usage of compressed audio for training ASR models. Maybe some neural codecs (I think...

vadimkantorov

Deprecate data utils

3

PyTorch core has download function and `torch.hub.download_url_to_file`. Torchaudio can use it for dataset download and does not need to maintain its own `torchaudio.datasets.utils.download_url`. In addition to that, there seems to...

mthrok

help wanted

good first issue

contributions welcome

Support custom padding in istft

6

## 🚀 Feature Current version of istft force check NOLA thus some kind of padding (e.g. asymmetrical padding or #427 ) will raise assertion error. I would like to see...

yuzhms

Publish an official release for torchaudio 0.8.2

7

PyTorch v1.8.2 LTS has been released. The corresponding `torchaudio` version is presumably `v0.8.2`, but there exists no such official release. This has been an issue for us, [as commented here](https://github.com/pytorch/pytorch.github.io/issues/828#issuecomment-941370876)....

gkowarzyk

filtfilt consistency with scipy

5

### 🐛 Describe the bug Hello! I tried using created in #1681 `torchaudio.functional.filtfilt` instead of `scipy.signal.filtfilt`. In my observation there is no such combination of arguments that filtered output is...

SolomidHero

Centroid frequency limits

5

It was mentioned in #1059 that it might be good to support a narrow range of frequencies to find the spectral centroid. This is an attempt at that. Apologies if...

jacobjwebber

cla signed

ciflow/default

audio
audio copied to clipboard

Metadata

error when building torchaudio v0.9.0 from source on ppc64le (power9)

New interface for MVDR beamforming

Spectrogram transform with float16 precision

how do i train the model for different language?

[dicscussion] Batched CPU/GPU audio decoding / encoding

Deprecate data utils

Support custom padding in istft

Publish an official release for torchaudio 0.8.2

filtfilt consistency with scipy

Centroid frequency limits

← Metadata

Owner

Metadata

audio audio copied to clipboard

Metadata

← Metadata

Owner

Metadata

audio
audio copied to clipboard