parler-tts issues

Results 83 parler-tts issues

Sort by recently updated

run_parler_tts_training.py gives datasets.table.CastError error and failure

I've run through the steps to train a single voice, and it goes well until it comes time to actually **Fine-tuning Parler-TTS** step, i'm hitting a wall. It seems the...

duringleaves

Flash Attention Support

Is there any way the Flash Attention 2 support for this model? if there is a way to do it i would love to get involved and help out! I've...

sang-nguyen-ts

Fix typo in INFERENCE.md, change return_tensors and to correct usage of device

First of Kudos for such a brilliant project. I found a typo in the INFERENCE.md, that caused an issue. It referred to `device` rather than the `torch_device`.

unclecode

some question to prepare multilinguality training from scratch

congrats to release v2 parler-tts @sanchit-gandhi @ylacombe or anyone involve i am trying to explore reproduce multilinguality training, some question to ask if i want to train it multilingual 1....

acul3

Alltalk Integration & 1x question re voice consistency

Hi Congratulations on releasing the V1 update. I've integrated the v1 update into AllTalk v2 BETA https://github.com/erew123/alltalk_tts/tree/alltalkbeta If you are ever interested in making any updates to the setup for...

erew123

any list of all 36 voices?

just want a list

OpenMachinesAI

[Performance] Usage with `optimum-quanto`?

Hey there. I'm trying to use `parler-tts` for near-realtime text to speech, just fast enough for conversations, on CPU inference. I'm trying to quantize your model in int8 using the...

N3RDIUM

Using static cache, the inference time is high from the 4th run onwards

I have tested the static cache inference, but the results are not as expected. I observed that the first two runs are for warming up, torch compiling... The third run...

dongngm

Bug in generation code

At this [line](https://github.com/huggingface/parler-tts/blob/8b8c576e2dbdc29172e30be7d68fac9357cd92c5/parler_tts/modeling_parler_tts.py#L1430), a delay pattern mask is generated and applied to the initial audio IDs. Then, at this [line](https://github.com/huggingface/parler-tts/blob/8b8c576e2dbdc29172e30be7d68fac9357cd92c5/parler_tts/modeling_parler_tts.py#L1530), a mask is also generated to revert the delay on...

Squire-tomsk

parler-tts
parler-tts copied to clipboard

Metadata

run_parler_tts_training.py gives datasets.table.CastError error and failure

Flash Attention Support

GGML implementation when?

Fix typo in INFERENCE.md, change return_tensors and to correct usage of device

some question to prepare multilinguality training from scratch

Alltalk Integration & 1x question re voice consistency

any list of all 36 voices?

[Performance] Usage with `optimum-quanto`?

Using static cache, the inference time is high from the 4th run onwards

Bug in generation code

← Metadata

Owner

Metadata

parler-tts parler-tts copied to clipboard

Metadata

← Metadata

Owner

Metadata

parler-tts
parler-tts copied to clipboard