audio issues

Add manifold support

2

Differential Revision: D32078361

mthrok

fb-exported

cla signed

ciflow/default

other

InverseMelScale does not work in inference mode

11

`InverseMelScale` uses SGD inside so it does not work when the global context is `no_grad` or `inference_mode`. Or even when `requires_grad=False` would make it fail. This gives bad UX for...

mthrok

Issues with transforms.InverseMelScale

6

Hi! I have had some issues using InverseMelScale Firstly, I used the transform on a Spectrogram, without taking the log or using AmplitudeToDB on the spectrogram. This resulted in very...

jacobjwebber

Define TORCHAUDIO_API for better visbility control

3

## Context: TorchAudio uses dual-binding (PyBind11 and TorchBind) to make custom operations available in Python. The both binding eventually calls the same implementation contained in `libtorchaudio[_XXX].so`. The ones bound via...

mthrok

CLA Signed

BE hackathon

Change the path handling in fluent speech command dataset

The dataset described in CSV file is Posix-style path, which requires OS-agnostic handling on Windows.

mthrok

cla signed

Cleanup conda channel flags, make sure we can switch easily between pytorch-nightly, pytorch-test and pytorch

### 🐛 Describe the bug Cleanup conda channel flags, make sure we can switch easily between pytorch-nightly, pytorch-test and pytorch we have following logic in torchaudio and torchtext: https://github.com/pytorch/audio/blob/main/packaging/pkg_helpers.bash#L210 There...

atalman

Exporting the operator stft to ONNX opset version 9 is not supported.

21

Hi, I try exporting the process of feature extraction to onnx: ``` import torch import torchaudio model = torchaudio.transforms.MelSpectrogram() x = torch.randn(1, 16000) torch.onnx.export(model, x, 'tmp.onnx', input_names=['input'], output_names=['output']) ``` and...

lawlict

Add pretrained weights from Voxpopuli

2

[VoxPopuli](https://github.com/facebookresearch/voxpopuli) publishes pre-trained models of many different languages under [CC BY-NC 4.0](https://github.com/facebookresearch/covost/blob/main/LICENSE) license. We can add them to torchaudio. ## non-fine-tuned weights https://github.com/facebookresearch/voxpopuli#wav2vec-20 - [ ] es - base -...

mthrok

[NOT MERGE ]Test

atalman

cla signed

RNN Transducer Loss

1

This issue is to track the follow-up work to #1137, which introduced `rnnt_loss` and `RNNTLoss` as a [prototype](https://pytorch.org/audio/stable/index.html) in `torchaudio.prototype.transducer` using [HawkAaron's warp-transducer](https://github.com/HawkAaron/warp-transducer). - Update documentation - [ ] Guard...

vincentqb

audio
audio copied to clipboard

Metadata

Add manifold support

InverseMelScale does not work in inference mode

Issues with transforms.InverseMelScale

Define TORCHAUDIO_API for better visbility control

Change the path handling in fluent speech command dataset

Cleanup conda channel flags, make sure we can switch easily between pytorch-nightly, pytorch-test and pytorch

Exporting the operator stft to ONNX opset version 9 is not supported.

Add pretrained weights from Voxpopuli

[NOT MERGE ]Test

RNN Transducer Loss

← Metadata

Owner

Metadata

audio audio copied to clipboard

Metadata

← Metadata

Owner

Metadata

audio
audio copied to clipboard