audio
audio copied to clipboard
Data manipulation and transformation for audio signal processing, powered by PyTorch
### š Describe the bug -- i understand i still have to respond to my PR on kernel creation speed (sorry about that!) - but I found another problem when...
### š The feature scipy.signal.lfilter supports an initial condition zi. This is critical for dealing with streaming inputs where we get one sample at a time. This feature is currently...
### š The feature Hello, Thanks for the handy tools for parsing the database! I am wondering if it is possible to let the torchaudio.dataset classes have two modes: 1....
# Request For Comment: The future of Kaldi-compatible features ## Problems `torchaudio.compliance.kaldi` implements functionalities that tries to reproduce Kaldi's feature extractions, and this module has many issues, and causing headache...
### š Describe the bug I've found that the output of the wav2vec2 pipeline model is bugged, and changes depending on the zero-padding used in batch preprocessing, a simple example...
### š Describe the bug I am trying to convert an audio tensor into 'gsm' format to simulate communication process with this apply_codec function. However, beside a transformed tensor, this...
### š The feature Iād like to propose the integration of tree-constrained pointer generator (TCPGen) [1] and Minimum Biasing Word Error (MBWE) training [2] for contextual biasing into torchaudio package....
Added HIPIFY code and small changes for ROCm. Targeting RNN-T loss.
### š The doc issue 1. How do we build our own language model and add it to the language model, such as wav2vec2? However many of the solutions from...
### š Describe the bug First, download a `wav` and a `mp3` file: ``` wget https://filesamples.com/samples/audio/wav/sample3.wav wget https://filesamples.com/samples/audio/mp3/sample3.mp3 ``` Here is a short repro: ```python import torchaudio # try reading...