audio icon indicating copy to clipboard operation
audio copied to clipboard

Deprecate data utils

Open mthrok opened this issue 4 years ago • 3 comments

PyTorch core has download function and torch.hub.download_url_to_file. Torchaudio can use it for dataset download and does not need to maintain its own torchaudio.datasets.utils.download_url.

In addition to that, there seems to be something wrong with torchaudio.datasets.utils.download_url. It fails to download multiple of huge files. I think we should get rid of torchaudio.datasets.utils.download_url and torchaudio.datasets.utils.stream_url.

Steps

Phase 1

  1. Replace the usages of torchaudio.datasets.utils.download_url in torchaudio with torch.hub.download_url_to_file and torchaudio.datasets.utils.validate_file
  2. Add deprecation warning to torchaudio.datasets.utils.download_url and torchaudio.datasets.utils.stream_url, saying it will be removed in v0.12.

Phase 2

Once the v0.11 is released and the main branch is bumped to v0.12, we can remove these functions.

Note

For setting up the development environment, please refer to CONTRIBUTING.md and torchaudio_unittest

mthrok avatar Oct 15 '21 21:10 mthrok

I am working on this issue

jdsgomes avatar Dec 14 '21 16:12 jdsgomes

@mthrok can I work on this issue?

radioactive11 avatar Jan 12 '22 06:01 radioactive11

Hi @radioactive11

Sorry for the late reply. The phase 1 is completed and we need to wait until the branch cut of v0.11 to perform the phase 2. I will update this once the branch cut happens.

mthrok avatar Jan 18 '22 16:01 mthrok