UniAudio issues

SVS task recipe

SVS task recipe will be provided soon?

text output task

great work! I notice that it seems only audio generation tasks are supported, but some of tasks like `ASR` is defined in code, have you tried task for text-predict-task,thks!

HaiFengZeng

Recipe of TSE?

Nice work! Could you please also share your recipe of TSE?

wjyfelicity

inference TTS

1

Hi i have succeded to debug a few errors from the inference code. Can you please add the DecoderOnlyModel in model.py and what is phone.1.pt file that is required by...

pawanhv

running TTS egs 1. text_tokenizer.py https://github.com/yangdongchao/UniAudio/blob/b4da009653486828b4d71a6efe1772b1403ce324/UniAudio/tools/tokenizer/phone/text_tokenizer.py#L12 no TextTokenizer ``` class Text2PhoneTokenizer(AbsTokenizer): def __init__(self, langdir='UniAudio/checkpoints/lang_nosp'): super(TextTokenizer, self).__init__() ``` 2. offline_tokenization.py ``` 5 Traceback (most recent call last): 6 File "data_scripts/offline_tokenization.py", line...

Jackiexiao

License?

Can you please add a license?

kachiO

Training time

Amazing work! Could you please let me know more detailed training information such as the training time? Thanks.

huifu99

example of tts(zero shot) on libritts?

2

Hi, there are some examples of tts(zero shot) on libritts?

deyituo

Whether the absence of punctuation in TTS data will affect the experimental results.

Hello, dear author!I have a question to ask you for advice. During my preparing TTS experimental data (LibriLight for tts), I noticed that the transcribed results only consist of text...

Alidaling

The difference between AudioTokenizer and EncodecTokenizer?

3

I find 2 tokenizer models for audio, AudioTokenizer and EncodecTokenizer. In egs, tts, vc, and se all use tokenizer "audio". I guess these models are all based on SoundStream. What's...

chenxinglili

UniAudio
UniAudio copied to clipboard

Metadata

SVS task recipe

text output task

Recipe of TSE?

inference TTS

several bugs

License?

Training time

example of tts(zero shot) on libritts?

Whether the absence of punctuation in TTS data will affect the experimental results.

The difference between AudioTokenizer and EncodecTokenizer?

← Metadata

Owner

Metadata

UniAudio UniAudio copied to clipboard

Metadata

← Metadata

Owner

Metadata

UniAudio
UniAudio copied to clipboard