audio icon indicating copy to clipboard operation
audio copied to clipboard

Data manipulation and transformation for audio signal processing, powered by PyTorch

Results 324 audio issues
Sort by recently updated
recently updated
newest added

## Repro Instructions: ``` git clone --recursive https://github.com/pytorch/audio.git cd audio git checkout v0.10.2 export USE_CUDA=1 export BUILD_SOX=1 CC=gcc-9 CXX=g++-9 python3.8 setup.py bdist_wheel python3.8 -m pip install dist/*.whl python3.8 -c 'import...

### 🚀 The feature Is there any plan/interest to enable Bark spectrogram calculation in torchaudio? ### Motivation, pitch More flexibility to users of torchaudio especially for ML-DSP purposes ### Alternatives...

new feature

## 🐛 Bug The output of the fbank feature calculations differs from that of kaldi. ## To Reproduce Steps to reproduce the behavior: using the following or even the defaults...

### 🚀 The feature [Hidden-Unit BERT (HuBERT)](https://arxiv.org/pdf/2106.07447.pdf?fbclid=IwAR3hI4uGqc4mV5j-ob8R5yLu-BaamVoe9ncxUoVmgFLjJXsE1IevP0rdNYY), a self-supervised model for speech representations was proposed and wildly used in down-stream tasks, such as speech recognition, speech diarization, speaker identification, etc....

# Torchaudio Complex Tensor Support and Migration ## Overview `torchaudio` has been expressing complex numbers by having an extra dimension for real-part and imaginary-part. (We will refer this format as...

complex dtype migration

This sounds like a wrong and spurious dependency, python-3.10 is available in conda

cla signed
ciflow/default

The docstrings of `mu_law_encoding` and `mu_law_decoding` say the following. https://github.com/pytorch/audio/blob/a8fbbdac1de389cf7799b0abd0684f9b77c2f675/torchaudio/functional/functional.py#L491-L492 And at the beginning of the operation, it performs conversion for non-real-float Tensor into real float. https://github.com/pytorch/audio/blob/a8fbbdac1de389cf7799b0abd0684f9b77c2f675/torchaudio/functional/functional.py#L502-L503 Note: `is_floating_point()` return...

Example code for [ComputeDeltas](https://github.com/pytorch/audio/blob/94f4ef0fb4a7bdb58122ddad204ecc9a697c4098/torchaudio/transforms.py#L936) as mentioned in [this issue](https://github.com/pytorch/audio/issues/1564)

cla signed
ciflow/default

This commit reverts 05ae795a

cla signed
ciflow/default

Hi, after installing PyTorch 1.9.0, I tried to install torchaudio v0.9.0 (release/v0.9) from source on a ppc64le machine. However, I received an error that CMake is unable to locate protobuf....