audio issues

Add StreamWriter tutorial

Add a tutorial for basic usage of torchaudio.io.StreamWriter.

mthrok

cla signed

Adopt :autosummary: to multiple modules

3

Adopt `:autosummary:` to various modules * torchaudio.compliance.kaldi * torchaudio.sox_effects * torchaudio.utils

mthrok

cla signed

RNNT Loss Accepting Log Probabilities

2

### 🚀 The feature @carolineechen The current RNNT loss takes logits as inputs. I wonder if it is possible to have a version that takes log probabilities rather than logits....

BriansIDP

torchaudio.io._compat.load_audio_fileobj returns shorter waveform than torchaudio.load for certain FLAC files

7

### 🐛 Describe the bug When using the two loading methods on the same audio file, the lengths of the waveform tensors are different. I can reproduce this issue with...

nateanl

StreamReader Failed to open the input io.BytesIO

8

### 🐛 Describe the bug StreamReader Failed to open the input io.BytesIO which is save by `torchaudio.save` ```python import io import torchaudio from torchaudio.io import StreamReader wav_file = "demo.wav" streamer...

Jackiexiao

Add support for Modified Discrete Cosine Transform (MDCT)

4

### 🚀 The feature The [Modified Discrete Cosine Transform (MDCT)](https://en.wikipedia.org/wiki/Modified_discrete_cosine_transform) is a perfectly invertible transform that can be used for feature extraction. It can be used as an alternative to...

Kinyugo

torchaudio.sox_effects.apply_effects_file failed to load from `.mp3` format

2

### 🐛 Describe the bug Directly load `.mp3` audio with `torchaudio.sox_effects.apply_effects_file` will fail: ```python import torchaudio file = "clips/common_voice_id_25649986.mp3" effects = [['speed', '0.9'], ['rate', '48000']] torchaudio.sox_effects.apply_effects_file(file, effects) # output: #...

maxwellzh

Add unit test for LibriMix dataset

nateanl

cla signed

Increase timeout for the conda installs

1

Increase timeout for the conda installs. I am observing following error on linux conda install: ``` Too long with no output (exceeded 10m0s): context deadline exceeded ``` Ref: https://app.circleci.com/pipelines/github/pytorch/audio/12563/workflows/a99a4f55-4006-406a-9d2a-89a24311aa0c/jobs/899681

atalman

cla signed

Add lintrunner

Re-using the same linter as currently used by PyTorch core

malfet

cla signed

audio
audio copied to clipboard

Metadata

Add StreamWriter tutorial

Adopt :autosummary: to multiple modules

RNNT Loss Accepting Log Probabilities

torchaudio.io._compat.load_audio_fileobj returns shorter waveform than torchaudio.load for certain FLAC files

StreamReader Failed to open the input io.BytesIO

Add support for Modified Discrete Cosine Transform (MDCT)

torchaudio.sox_effects.apply_effects_file failed to load from `.mp3` format

Add unit test for LibriMix dataset

Increase timeout for the conda installs

Add lintrunner

← Metadata

Owner

Metadata

audio audio copied to clipboard

Metadata

← Metadata

Owner

Metadata

audio
audio copied to clipboard