audio icon indicating copy to clipboard operation
audio copied to clipboard

Data manipulation and transformation for audio signal processing, powered by PyTorch

Results 324 audio issues
Sort by recently updated
recently updated
newest added

Added a Librispeech ctc recipe for demonstration purpose. - This recipe demonstrates using either [torch.nn.CTCLoss](https://pytorch.org/docs/stable/generated/torch.nn.CTCLoss.html) or [k2.ctc_loss](https://k2-fsa.github.io/k2/python_api/api.html#ctc-loss). Both can converge to similar results. - It supports using either CTC or...

CLA Signed

Hello all! Currently, TorchAudio doesn’t provide much support for music use cases. We’d like to gauge interest from the community in our improving that support. Some requests we’ve received include...

RFC

### 🚀 The feature It would be nice if seek could go to a specific frame rather than the timestamp. ### Motivation, pitch I'm taking strided chunks from an audio...

### 🐛 Describe the bug Not a serious bug. Different to what the documentation says, I had to specify `format="rgb24"` to force the RGB colorspace it. ### Versions ``` Collecting...

### 🐛 Describe the bug When building Torchaudio with rocm5.6.0, cmake couldn't find the path to rocrand ``` CMake Error at cmake/LoadHIP.cmake:138 (find_package): By not providing "Findrocrand.cmake" in CMAKE_MODULE_PATH this...

module: rocm

### 🐛 Describe the bug Issue with Hubert Pre-training scripts in : https://github.com/pytorch/audio/tree/main/examples/hubert I am unable to resume the training by loading the latest "End of the epoch" checkpoints. I...

### 🐛 Describe the bug I also encountered the same problem as [2870](https://github.com/pytorch/audio/issues/2870), which caused a Floating point exception (core dumped) when loading ADPCM encoded audio and caused the service...

### 🐛 Describe the bug In an example of https://pytorch.org/audio/stable/transforms.html, `TimeStretch` takes arguments as follows: ```python TimeStretch(stretch_factor, fixed_rate=True) ``` This usage is incorrect. According to https://pytorch.org/audio/stable/generated/torchaudio.transforms.TimeStretch.html, this class takes the...

Ref https://github.com/pytorch/audio/issues/3317#issuecomment-1540433493 GPU encoder used to accept RGB24, but with the recent refactoring in main branch it expect RGBA32. Extra padding option to convert RGB24 to RGBA32 should be added...

This is #3082 with an added `pytest` configuration as suggested in https://github.com/pmeier/pytest-results-action/issues/9#issuecomment-1573258305.

CLA Signed