audio
audio copied to clipboard
Data manipulation and transformation for audio signal processing, powered by PyTorch
## Repro Instructions: ``` git clone --recursive https://github.com/pytorch/audio.git cd audio git checkout v0.10.2 export USE_CUDA=1 export BUILD_SOX=1 CC=gcc-9 CXX=g++-9 python3.8 setup.py bdist_wheel python3.8 -m pip install dist/*.whl python3.8 -c 'import...
### 🚀 The feature Is there any plan/interest to enable Bark spectrogram calculation in torchaudio? ### Motivation, pitch More flexibility to users of torchaudio especially for ML-DSP purposes ### Alternatives...
## 🐛 Bug The output of the fbank feature calculations differs from that of kaldi. ## To Reproduce Steps to reproduce the behavior: using the following or even the defaults...
### 🚀 The feature [Hidden-Unit BERT (HuBERT)](https://arxiv.org/pdf/2106.07447.pdf?fbclid=IwAR3hI4uGqc4mV5j-ob8R5yLu-BaamVoe9ncxUoVmgFLjJXsE1IevP0rdNYY), a self-supervised model for speech representations was proposed and wildly used in down-stream tasks, such as speech recognition, speech diarization, speaker identification, etc....
# Torchaudio Complex Tensor Support and Migration ## Overview `torchaudio` has been expressing complex numbers by having an extra dimension for real-part and imaginary-part. (We will refer this format as...
This sounds like a wrong and spurious dependency, python-3.10 is available in conda
The docstrings of `mu_law_encoding` and `mu_law_decoding` say the following. https://github.com/pytorch/audio/blob/a8fbbdac1de389cf7799b0abd0684f9b77c2f675/torchaudio/functional/functional.py#L491-L492 And at the beginning of the operation, it performs conversion for non-real-float Tensor into real float. https://github.com/pytorch/audio/blob/a8fbbdac1de389cf7799b0abd0684f9b77c2f675/torchaudio/functional/functional.py#L502-L503 Note: `is_floating_point()` return...
Example code for [ComputeDeltas](https://github.com/pytorch/audio/blob/94f4ef0fb4a7bdb58122ddad204ecc9a697c4098/torchaudio/transforms.py#L936) as mentioned in [this issue](https://github.com/pytorch/audio/issues/1564)
Hi, after installing PyTorch 1.9.0, I tried to install torchaudio v0.9.0 (release/v0.9) from source on a ppc64le machine. However, I received an error that CMake is unable to locate protobuf....