audio issues

Loading Opus files from MLS dataset fails because of file metadata

### 🐛 Describe the bug I'm using [MLS](https://www.openslr.org/94/) in Opus. For example trying to load [11956_10613_000004.opus.tar.gz](https://github.com/user-attachments/files/16454390/11956_10613_000004.opus.tar.gz). Loading works fine for other datasets. ```python import torchaudio torchaudio.load("11956_10613_000004.opus") ``` ``` Traceback (most...

niemiaszek

transforms.MFCC results in NaN values on Jetson Orin Nano

### 🐛 Describe the bug On a Jetson Orin Nano, applying the `transforms.MFCC` transform with certain parameters to samples from the `datasets.SPEECHCOMMANDS` dataset, results in some values in the resulting...

frmser

Replace runners prefix amz2023.

1

testing new runners

jeanschmidt

CLA Signed

Division by zero in loudness calculation

### 🐛 Describe the bug The following line in the functional method `loudness` results in `nan` value when the entire waveform is below the hardcoded loudness threshold value `gamma_abs =...

DanTremonti

Replace runners prefix amz2023.

1

testing new runners

jeanschmidt

CLA Signed

Replace runners prefix amz2023.

1

testing new runners

jeanschmidt

CLA Signed

Video reading: torchaudio.io.StreamReader seek method returns the first frame, regardless of the input start_timestep (on version 0.13.1)

### 🐛 Describe the bug Using this code: ``` import torch from torchaudio.io import StreamReader video_path = stream_reader = StreamReader(video_path) stream_reader.add_video_stream(5, decoder= 'h264_cuvid', hw_accel='cuda') start_timestep = 10 stream_reader.seek(start_timestep) for (chunk,...

StolikTomer

Use std::optional types

1

cyyever

CLA Signed

Loading failure errors should indicate what was being loaded when error occured

I was recently working with a project where some of the audio files wouldn't load properly at the end. Since the loading order is shuffled, it was difficult to tell...

pokepress

StreamReader.add_basic_video_stream drops last frame if `frame_rate` is specified

1

### 🐛 Describe the bug Using `add_basic_video_stream` causes the last frame of a video to be erroneously dropped. first download [`example.mp4`](https://drive.google.com/file/d/16g1xVAyO-jaEVRmxG4UFow7Y3LWX-phR/view?usp=drive_link) (177KB). ```python import torio def read_video(file_path, frame_rate=25): reader =...

tyler-rt

audio
audio copied to clipboard

Metadata

Loading Opus files from MLS dataset fails because of file metadata

transforms.MFCC results in NaN values on Jetson Orin Nano

Replace runners prefix amz2023.

Division by zero in loudness calculation

Replace runners prefix amz2023.

Replace runners prefix amz2023.

Video reading: torchaudio.io.StreamReader seek method returns the first frame, regardless of the input start_timestep (on version 0.13.1)

Use std::optional types

Loading failure errors should indicate what was being loaded when error occured

StreamReader.add_basic_video_stream drops last frame if `frame_rate` is specified

← Metadata

Owner

Metadata

audio audio copied to clipboard

Metadata

← Metadata

Owner

Metadata

audio
audio copied to clipboard