open-tts-tracker icon indicating copy to clipboard operation
open-tts-tracker copied to clipboard

Adding columns to understand Model Performance

Open trivikramak opened this issue 1 year ago • 5 comments

Can we add some metrics or a relative scoring mechanism or something like that to understand how good the models are?

trivikramak avatar Jan 30 '24 11:01 trivikramak

@Pendrokar is doing that here: https://github.com/Vaibhavs10/open-tts-tracker/pull/19

Vaibhavs10 avatar Jan 30 '24 11:01 Vaibhavs10

With "Real-time factor below threshold" being that relative scoring for CPU. For GPU acceleration it would be too hard to tell. Best to just visit the Repo of the specific TTS or test the 🤗 Space and see if they provide the data.

Pendrokar avatar Jan 30 '24 17:01 Pendrokar

@trivikramak The pull request for adding the capability table has been accepted. Feel free to correct information within it with a PR. I judged the capability from HF Spaces. If a processor column is empty, then that means the RTF was quick with CUDA, but I could not determine if it would have been as fast with just the CPU. https://github.com/Vaibhavs10/open-tts-tracker/blob/main/README.md#capability-specifics

Pendrokar avatar Mar 11 '24 11:03 Pendrokar

@fakerybakery I noticed you've added CPU as processor for StyleTTS under the capabilities table, but when I cloned the space of it on HF, I got a RTF of around 7.0. 🤔 Now I did choose "RTF below 2.0" condition arbitrarily as any TTS is capable of processing the audio with CPU. Just the question arises if there is a point to including those whose RTF is 5, 10, 20 with CPU. If so, all cells would have CPU as the processor... @Vaibhavs10 thoughts?

I see an RTF of 2.0 as a good target as a TTS with streaming support would be able to playback the audio once it gets passed half-way of processing the audio.

Pendrokar avatar Mar 16 '24 23:03 Pendrokar

@Pendrokar I think it also depends on which CPU your using. On a MacBook I get much higher speeds, free HF Spaces have a quite basic CPU

fakerybakery avatar Mar 16 '24 23:03 fakerybakery