audio issues

StreamWriter The h264_nvenc/hevc_nvenc encoder supports the YUV420P format

4

### 🚀 The feature Is the module that utilizes nvenc for accelerated encoding considering support for the yuv420p format? ### Motivation, pitch I am using this module for accelerated video...

doraxcyle

unit tests Windows GPU failing

1

### 🐛 Describe the bug Following test are failing: https://app.circleci.com/pipelines/github/pytorch/audio/16491/workflows/e9d2d0de-56ba-42f8-804b-77bf26fa291f/jobs/1213093 unittest_windows_gpu_py3.8: ``` FAILED torchaudio_unittest\io\stream_reader_test.py::FilterGraphWithCudaAccel::test_scale_cuda_format - RuntimeError: Failed to create the filter from "scale_cuda=format=yuv444p" (Invalid argument.) FAILED torchaudio_unittest\io\stream_reader_test.py::FilterGraphWithCudaAccel::test_sclae_cuda_change_size - RuntimeError: Failed...

atalman

Add high-level IO function for image and video

1

Leveraging StreamReader we can load video and images. We should add functions `torchaudio.io.load_image` `torchaudio.io.load_video` `torchaudio.io.load_audio` which are thin wrapper around `StreamReader`. (and perhaps `save` versions)

mthrok

Update StreamReader tutorial

1

to include PTS and the difference of seek

mthrok

RuntimeError when loading large FLAC file

1

### 🐛 Describe the bug When using `torchaudio.load` on a 1.79 GB FLAC file, it throws `RuntimeError: Trying to create tensor with negative dimension -225262592: [-225262592, 2]`. ```python import json...

notadoktor

triaged

Add filter and filter_complex to StreamWriter

7

### 🚀 The feature Add the capability for ffmpeg filters (-filter, -filter_complex) in StreamWriter and StreamReader according to ffmpeg filters: https://ffmpeg.org/ffmpeg-filters.html It'll be good to add an argument to set...

maysteinfeld

module: IO

triaged

New `ffmpeg` backend changes samples when saving WAVE

2

### 🐛 Describe the bug Snippet to reproduce the error is provided below. Adding `backend="sox"` or `backend="soundfile"` to `torchaudio.save` removes the issue. ```python import os from tempfile import NamedTemporaryFile os.environ["TORCHAUDIO_USE_BACKEND_DISPATCHER"]...

pzelasko

triaged

List of feature requests received so far for StreamReader/Writer

Here is the list of feature requests for StreamReader/Writer I have received so far. Feel free to add 1. [x] PTS support in StreamWriter #3135 When processing videos/audios, with StreamReader/Writer,...

mthrok

module: IO

triaged

torchaudio.functional.resample() doesn't work with complex types

2

### 🐛 Describe the bug `torchaudio.functional.resample()` doesn't work with complex types. Example: ``` x = torch.randn(1024, 2) x = torch.view_as_complex(x) y = torchaudio.functional.resample(x, 1.0, 2.0) ``` Gives error: ``` RuntimeError:...

pfeatherstone

triaged

Additional spectral feature transformations

8

While torchaudio provides a Mel-scaled spectrogram transformation (`torchaudio.transforms.MEL`), there’re a few additional spectral feature transformations that are extremely useful for pre-processing and data augmentation. For example, two feature transformations that...

0x00b1

audio
audio copied to clipboard

Metadata

StreamWriter The h264_nvenc/hevc_nvenc encoder supports the YUV420P format

unit tests Windows GPU failing

Add high-level IO function for image and video

Update StreamReader tutorial

RuntimeError when loading large FLAC file

Add filter and filter_complex to StreamWriter

New `ffmpeg` backend changes samples when saving WAVE

List of feature requests received so far for StreamReader/Writer

torchaudio.functional.resample() doesn't work with complex types

Additional spectral feature transformations

← Metadata

Owner

Metadata

audio audio copied to clipboard

Metadata

← Metadata

Owner

Metadata

audio
audio copied to clipboard