audio issues

Biquad functions not seemingly compatible with autocast with bfloat16

### 🐛 Describe the bug I attempted to use highpass_biquad to calculate a loss while inside of an autocast block: ``` self.floatFormat = torch.bfloat16 if torch.cuda.is_bf16_supported() else torch.float32 ... with...

pokepress

[binary builds] torchaudio Windows cuda nightly builds are failing since 01.29.2025

### 🐛 Describe the bug Windows Nightly builds are broken: [hud.pytorch.org/hud/pytorch/audio/nightly/1?per_page=50&name_filter=windows](https://hud.pytorch.org/hud/pytorch/audio/nightly/1?per_page=50&name_filter=windows) Workflow: [pytorch/audio/actions/runs/13030436154/job/36348301796#step:12:3346](https://github.com/pytorch/audio/actions/runs/13030436154/job/36348301796#step:12:3346) Error: ``` 2025-01-29T11:58:41.0321201Z C:/actions-runner/_work/_temp/conda_environment_13030436154/Lib/site-packages/torch/include\torch/csrc/dynamo/compiled_autograd.h(962): error C2872: 'std': ambiguous symbol ``` Failure is related to this PR: https://github.com/pytorch/pytorch/pull/144707...

atalman

fix: fixing batching in MMS_FA

3

Fixing batching in MMS_FA bundle

ex3ndr

Avoid torch.jit.isinstance to support torch.compile(MFCC(), fullgraph=True)

3

The following code fails on main but works in this PR. ```py from torchaudio.transforms import MFCC import torch mfcc = torch.compile(MFCC(), fullgraph=True) ``` Unfortunately, the compiled model can still not...

twoertwein

CLA Signed

Torchaudio package on conda-forge is missing dynamic libs

### 🐛 Describe the bug Related [issue is here](https://github.com/conda/conda/issues/14497). Package on conda-forge (`linux-64/torchaudio-2.5.1-cpu_py312h6db6e6d_0.conda`) is missing ffmpeg dynamic libs, so it is not possible to use it. Please fix this package....

andrew-aladjev

SQUIM running in real-time

I applied SQUIM to assess speech quality as a way to correct the direction-of-arrival of a location-based speech enhancement system. [More info here](https://www.sciencedirect.com/science/article/pii/S1051200424005840). I'm feeding the last 3-second window of...

balkce

Add instruction to implement audio augmentation using tourchaudio.sox…

2

…_effects PLEASE NOTE THAT THE TORCHAUDIO REPOSITORY IS NO LONGER ACTIVELY MONITORED. You may not get a response. For open discussions, visit https://discuss.pytorch.org/.

RosieLe12

Add support for Pseudo Quadrature Mirror Filter

1

### 🚀 The feature Add ability to apply PQMF to singal and inverse transform to reconstruct signal. ### Motivation, pitch PQMF is used for example in Multi-band MelGAN and DurIAN....

uis246

TypeError: unsupported operand type(s) for //: 'NoneType' and 'int'

### 🐛 Describe the bug I am trying to convert Nvidia NeMo's FilterbankFeaturesTA class to ONNX. Here is my code - ``` from nemo.collections.asr.parts.preprocessing.features import ( FilterbankFeatures, FilterbankFeaturesTA, make_seq_mask_like, )...

kabyanil

ffmpeg 7

### 🚀 The feature Hi just interested roughly when we will be supporting fffmpeg 7 It supports av1 hardware encoding/decoding on 40 series or L4 GPUs ### Motivation, pitch just...

lee101

audio
audio copied to clipboard

Metadata

Biquad functions not seemingly compatible with autocast with bfloat16

[binary builds] torchaudio Windows cuda nightly builds are failing since 01.29.2025

fix: fixing batching in MMS_FA

Avoid torch.jit.isinstance to support torch.compile(MFCC(), fullgraph=True)

Torchaudio package on conda-forge is missing dynamic libs

SQUIM running in real-time

Add instruction to implement audio augmentation using tourchaudio.sox…

Add support for Pseudo Quadrature Mirror Filter

TypeError: unsupported operand type(s) for //: 'NoneType' and 'int'

ffmpeg 7

← Metadata

Owner

Metadata

audio audio copied to clipboard

Metadata

← Metadata

Owner

Metadata

audio
audio copied to clipboard