audio issues

Build failure for v0.10.2 in nvidia/cuda:11.3.1-cudnn8-devel-ubuntu20.04

17

## Repro Instructions: ``` git clone --recursive https://github.com/pytorch/audio.git cd audio git checkout v0.10.2 export USE_CUDA=1 export BUILD_SOX=1 CC=gcc-9 CXX=g++-9 python3.8 setup.py bdist_wheel python3.8 -m pip install dist/*.whl python3.8 -c 'import...

Lokiiiiii

Bark Filterbank for torchaudio

4

### 🚀 The feature Is there any plan/interest to enable Bark spectrogram calculation in torchaudio? ### Motivation, pitch More flexibility to users of torchaudio especially for ML-DSP purposes ### Alternatives...

ahmed-fau

new feature

Fbank features are different from Kaldi Fbank

11

## 🐛 Bug The output of the fbank feature calculations differs from that of kaldi. ## To Reproduce Steps to reproduce the behavior: using the following or even the defaults...

jooan84

Add training recipes for HuBERT model pre-training and ASR fine-tuning

### 🚀 The feature [Hidden-Unit BERT (HuBERT)](https://arxiv.org/pdf/2106.07447.pdf?fbclid=IwAR3hI4uGqc4mV5j-ob8R5yLu-BaamVoe9ncxUoVmgFLjJXsE1IevP0rdNYY), a self-supervised model for speech representations was proposed and wildly used in down-stream tasks, such as speech recognition, speech diarization, speaker identification, etc....

nateanl

[Migration] Torchaudio Complex Tensor Support and Migration

4

# Torchaudio Complex Tensor Support and Migration ## Overview `torchaudio` has been expressing complex numbers by having an extra dimension for real-part and imaginary-part. (We will refer this format as...

mthrok

complex dtype migration

Remove use-conda-forge-if-py-310 assumptions

This sounds like a wrong and spurious dependency, python-3.10 is available in conda

malfet

cla signed

ciflow/default

Inconsistent behavior of mu-law encode/decode

2

The docstrings of `mu_law_encoding` and `mu_law_decoding` say the following. https://github.com/pytorch/audio/blob/a8fbbdac1de389cf7799b0abd0684f9b77c2f675/torchaudio/functional/functional.py#L491-L492 And at the beginning of the operation, it performs conversion for non-real-float Tensor into real float. https://github.com/pytorch/audio/blob/a8fbbdac1de389cf7799b0abd0684f9b77c2f675/torchaudio/functional/functional.py#L502-L503 Note: `is_floating_point()` return...

mthrok

Added example for ComputeDeltas

Example code for [ComputeDeltas](https://github.com/pytorch/audio/blob/94f4ef0fb4a7bdb58122ddad204ecc9a697c4098/torchaudio/transforms.py#L936) as mentioned in [this issue](https://github.com/pytorch/audio/issues/1564)

radioactive11

cla signed

ciflow/default

Re-enable subprocess TorchBind op test

This commit reverts 05ae795a

mthrok

cla signed

ciflow/default

torchaudio v0.9.0 installation error (protobuf not found)

3

Hi, after installing PyTorch 1.9.0, I tried to install torchaudio v0.9.0 (release/v0.9) from source on a ppc64le machine. However, I received an error that CMake is unable to locate protobuf....

JeromeNi

audio
audio copied to clipboard

Metadata

Build failure for v0.10.2 in nvidia/cuda:11.3.1-cudnn8-devel-ubuntu20.04

Bark Filterbank for torchaudio

Fbank features are different from Kaldi Fbank

Add training recipes for HuBERT model pre-training and ASR fine-tuning

[Migration] Torchaudio Complex Tensor Support and Migration

Remove use-conda-forge-if-py-310 assumptions

Inconsistent behavior of mu-law encode/decode

Added example for ComputeDeltas

Re-enable subprocess TorchBind op test

torchaudio v0.9.0 installation error (protobuf not found)

← Metadata

Owner

Metadata

audio audio copied to clipboard

Metadata

← Metadata

Owner

Metadata

audio
audio copied to clipboard