audio
audio copied to clipboard
Deprecate data utils
PyTorch core has download function and torch.hub.download_url_to_file.
Torchaudio can use it for dataset download and does not need to maintain its own torchaudio.datasets.utils.download_url.
In addition to that, there seems to be something wrong with torchaudio.datasets.utils.download_url.
It fails to download multiple of huge files. I think we should get rid of torchaudio.datasets.utils.download_url and torchaudio.datasets.utils.stream_url.
Steps
Phase 1
- Replace the usages of
torchaudio.datasets.utils.download_urlintorchaudiowithtorch.hub.download_url_to_fileandtorchaudio.datasets.utils.validate_file - Add deprecation warning to
torchaudio.datasets.utils.download_urlandtorchaudio.datasets.utils.stream_url, saying it will be removed inv0.12.
Phase 2
Once the v0.11 is released and the main branch is bumped to v0.12, we can remove these functions.
Note
For setting up the development environment, please refer to CONTRIBUTING.md and torchaudio_unittest
I am working on this issue
@mthrok can I work on this issue?
Hi @radioactive11
Sorry for the late reply. The phase 1 is completed and we need to wait until the branch cut of v0.11 to perform the phase 2. I will update this once the branch cut happens.