TTS-WebUI icon indicating copy to clipboard operation
TTS-WebUI copied to clipboard

Ideas for integration

Open RustoMCSpit opened this issue 1 year ago • 6 comments

  • [x] https://github.com/rakuri255/UltraSinger
  • [x] https://github.com/spotify/basic-pitch
  • [ ] https://github.com/IAHispano/Applio
  • [x] https://github.com/blaisewf/rvc-cli
  • [ ] https://github.com/stemrollerapp/stemroller
  • [ ] https://github.com/Frikallo/MISST

Vocoders:

VC Clients:

Pitch Extractors:

Other:

Edited by rsxdalv: turn list into a task list

RustoMCSpit avatar Dec 28 '24 03:12 RustoMCSpit

Thank you!

These I checked and seem to be good:

https://github.com/rakuri255/UltraSinger https://github.com/spotify/basic-pitch

Reasons:

  • Good license
  • Well known company
  • Good chance of future updates
  • New feature
  • Well written code / easy to integrate

I haven't had the chance to go through all of them, but for example, this one is very-low priority: https://github.com/blaisewf/rvc-cli

Reasons:

  • Non-commercial license
    • So it cannot be integrated directly
    • So people will not make improvements to it
    • So it provides a very limited improvement
  • The RVC technology is already partially integrated into the UI
  • Low popularity - lower popularity projects are less likely to improve the overall adaptation/situation of this tool, unless they are quite good on their own.

rsxdalv avatar Dec 29 '24 15:12 rsxdalv

https://github.com/DamRsn/NeuralNote

this uses basicpitch

RustoMCSpit avatar Dec 29 '24 16:12 RustoMCSpit

@rsxdalv

https://github.com/multimodal-art-projection/YuE https://github.com/deepbeepmeep/YuEGP

MUST SEE

RustoMCSpit avatar Feb 08 '25 21:02 RustoMCSpit

@rsxdalv

https://github.com/multimodal-art-projection/YuE https://github.com/deepbeepmeep/YuEGP

MUST SEE

@rsxdalv ???

RustoMCSpit avatar Apr 07 '25 10:04 RustoMCSpit

YuE has multiple layers of issues that make it a pain to integrate. For example, they use a modified vocos that obviously interferes with the existing vocos library. Then the installation requires cloning a 1.75gb repository and it's not clear how soon any fixes will be out of date. In reality, quite a few of these projects have massive issues when it comes to software engineering; though some are exceptionally difficult.

rsxdalv avatar Apr 11 '25 19:04 rsxdalv

Next, YuE does not seem to want to work without flash-attention which is hard to install for Windows users.

rsxdalv avatar Apr 11 '25 19:04 rsxdalv