audio
audio copied to clipboard
Data manipulation and transformation for audio signal processing, powered by PyTorch
### 🐛 Describe the bug I'm using [MLS](https://www.openslr.org/94/) in Opus. For example trying to load [11956_10613_000004.opus.tar.gz](https://github.com/user-attachments/files/16454390/11956_10613_000004.opus.tar.gz). Loading works fine for other datasets. ```python import torchaudio torchaudio.load("11956_10613_000004.opus") ``` ``` Traceback (most...
### 🐛 Describe the bug On a Jetson Orin Nano, applying the `transforms.MFCC` transform with certain parameters to samples from the `datasets.SPEECHCOMMANDS` dataset, results in some values in the resulting...
### 🐛 Describe the bug The following line in the functional method `loudness` results in `nan` value when the entire waveform is below the hardcoded loudness threshold value `gamma_abs =...
### 🐛 Describe the bug Using this code: ``` import torch from torchaudio.io import StreamReader video_path = stream_reader = StreamReader(video_path) stream_reader.add_video_stream(5, decoder= 'h264_cuvid', hw_accel='cuda') start_timestep = 10 stream_reader.seek(start_timestep) for (chunk,...
I was recently working with a project where some of the audio files wouldn't load properly at the end. Since the loading order is shuffled, it was difficult to tell...
### 🐛 Describe the bug Using `add_basic_video_stream` causes the last frame of a video to be erroneously dropped. first download [`example.mp4`](https://drive.google.com/file/d/16g1xVAyO-jaEVRmxG4UFow7Y3LWX-phR/view?usp=drive_link) (177KB). ```python import torio def read_video(file_path, frame_rate=25): reader =...