audio-diffusion-pytorch icon indicating copy to clipboard operation
audio-diffusion-pytorch copied to clipboard

Adding dilation args for use with updated a-unet PR

Open jmoso13 opened this issue 2 years ago • 1 comments

This PR adds default args to the construction of the UNet to include dilation and dropout. Details on changes to underlying repo for this to work can be found here:

  • https://github.com/archinetai/a-unet/pull/4

jmoso13 avatar Jun 12 '23 22:06 jmoso13

Hi @jmoso13,

I'm curious about this PR. Beyond taking inspiration from the refinements in SoundStream and EnCodec, are there any specific improvements you've noticed with this addition? I'm intuitively guessing that it might be quicker/easier to get high-end detail and keep the audio clean, based the better coverage offered by dilation, but that's a super hand-wavey intuition... I'm also not super clear on how I'd try this out. Right now I'm using the UNetV0, as in the ADP readme examples.

Any thoughts appreciated!

jbmaxwell avatar Jul 12 '23 20:07 jbmaxwell