DiffusionVideoEditing icon indicating copy to clipboard operation
DiffusionVideoEditing copied to clipboard

preprocessing scripts

Open azuredsky opened this issue 2 years ago • 7 comments

Good job, could you upload your data preprocessing scripts

azuredsky avatar Mar 27 '23 08:03 azuredsky

Really looking forward to your further code.

sdulyq avatar Mar 28 '23 03:03 sdulyq

+1, I am trying to training the Single Speaker Model, I've download identity S1 data in GRID corpus, however, I find that it could not run without right input, could you upload the data preprocessing scripts?

baiyuting avatar Apr 26 '23 03:04 baiyuting

Hi Guys,

Yes, I am submitting a revised version of the manuscript next week, and will also make everything public that day including all datasets and scripts.

@baiyuting Don't bother training the single speaker model yet, the updated version I have will give you much better results, and the way I do the audio conditioning in the current implementation is out of date. I will also be providing a multi-speaker model that works pretty well on unseen subjects.

Thank you so much for the patience, its taken me a bit longer than expected to get the new paper ready.

DanBigioi avatar Apr 26 '23 14:04 DanBigioi

ok, @DanBigioi looking forward to the new version code next week.

baiyuting avatar Apr 27 '23 01:04 baiyuting

Just a heads up guys, I've uploaded everything you need to start training/finetuning/running inference. I'll make some tutorial videos tomorrow too to better understand the code.

DanBigioi avatar May 11 '23 18:05 DanBigioi

@azuredsky @baiyuting @sdulyq

DanBigioi avatar May 11 '23 18:05 DanBigioi

@DanBigioi Thank you very much, this is the true spirit of open source on the Internet, and my friend, you are a true hero.

sdulyq avatar Jun 14 '23 01:06 sdulyq