ukemamaster issues

Results 11 issues of


                                            ukemamaster

[Bug] Fine tuned XTTS v2 produces strange sounds for short text

### Describe the bug I have fine tuned XTTS v2 model on my own data containing both long and short audios (with the following histogram showing duration in seconds on...

bug

Training a binary classifier (in case of 2 speakers)

@joonson To train a binary classifier (having 2 speakers in the entire data), what should be the values for `max_seg_per_spk , nClasses, nPerSpeaker, and batch_size` ? i have been trying...

Specify Minimum audio length in recut

## What In the re-cutting stage i would like to have an option to be able to specify minimum audio length. because the re-cutted audios are very very small and...

Using musicgen-melody model in Transformers library

Is it possible to use the `musicgen-melody model` in the [Transformers library](https://github.com/huggingface/transformers) like the [`musicgen-small model`](https://github.com/facebookresearch/audiocraft/blob/main/docs/MUSICGEN.md#-transformers-usage) ? I gave it a try : ```python from transformers import AutoProcessor, MusicgenForConditionalGeneration processor...

Repetition in interpolation

@hmartiro In interpolation, i always get repetitve music, every 5 seconds. Even with your seed images from huggungface repo. Any tips to avoid this?

How the seed images were generated?

Hi @hmartiro Could you please explain how did you generate the seed images? Are they simply spectrograms of music audios? or some pre- or post-filtering was applied? When i use...

Does this app work in interpolation mood or simple (text to audio) mood?

Hi @hmartiro. Could you please confirm if this app works in the interpolaion mood? or in simple (text to audio) mood? The simple (text to audio) mood generates 5.12 (for...

Fine tuning with custom (multilingual) data

Hi @OlaWod, i appreciate your work. I am trying to fine tune the FreeVC model with my custom multilingual data (using an already trained speaker encoder model), and without SR...

Multi node training

Hi @joonson, Could you please give some hints to make it work for a multi-node multi gpu distributed training?

softmaxproto loss for binary classification

Hi @joonson, I am trying to adapt the code to train a binary classifier. I am getting a validation accuracy of up to 99%, but the loss values are very...