fish-speech issues

3080 TI and it's still slow even with --compile it takes like 120 seconds at least even with small text input

bug

Which will be better? with indices or with hiddens?

Hi, Is there any experiments about LLM training speech input? there are two kind of inputs: the indices of codebook in codec, as a singel integer value, or the indexed...

JohnHerry

[BUG] 包名错误

pip install audio-seperator 报错。库里是这样写的。 pip install audio-separator 是可以的。

shinoairisu

bug

The conflict is caused by: transformers 4.35.2 depends on tokenizers=0.14 faster-whisper 0.8.0 depends on tokenizers==0.13.* transformers 4.35.2 depends on tokenizers=0.14 faster-whisper 0.7.1 depends on tokenizers==0.13.* transformers 4.35.2 depends on tokenizers=0.14...

shinoairisu

bug

[Feature] How to support new languages

1

你好，我想训练一个法语的tts，不知道是否需要修改代码？如何修改可以支持。另外想咨询下大概需要多少小时的干声可以训练出来一个比较好的tts？这个tts是专有领域的（科技），不需要那么强的泛化。

ILG2021

enhancement

[BUG] API接口调用报错，都是500错误 Internal Server Error

1

![image](https://github.com/user-attachments/assets/06fdd6ba-0295-4bff-852b-65aa1ed12995)

zane8521

bug

Fix Import Path in tools/vqgan/inference.py

This pull request addresses an issue in tools/vqgan/inference.py where the import statement for AUDIO_EXTENSIONS was incorrect. The import statement was originally: ```python from fish_speech.utils.file import AUDIO_EXTENSIONS ``` It has been...

octree

Fix(deps): remove audio-seperator

**Is this PR adding new feature or fix a BUG?** Add feature / Fix BUG. **Is this pull request related to any issue? If yes, please link the issue.** #xxx

Stardust-minus

llama 训练速度

1

训练t2s的速度很慢，大约0.09it/s，我使用的GPU为8卡RTX A6000，batch size 为16，请问这个训练速度正常吗？我用lightning profiler统计了一下，在backward和step的时候耗时最长这个是用advanced分析的backward和step的结果 ``` Profile stats for: [Strategy]DDPStrategy.backward rank: 0 190 function calls (185 primitive calls) in 43.795 seconds Ordered by: cumulative time ncalls tottime percall...

dukGuo

enhancement

fish-speech
fish-speech copied to clipboard

Metadata

Fix:(req) pinned torch version to 2.3.1, avoid inference speed abnorm…

[BUG] 3080 TI and it's still slow even with --compile

Which will be better? with indices or with hiddens?

[BUG] 包名错误

[BUG] 按照linux方法安装报错

[Feature] How to support new languages

[BUG] API接口调用报错，都是500错误 Internal Server Error

Fix Import Path in tools/vqgan/inference.py

Fix(deps): remove audio-seperator

llama 训练速度

← Metadata

Owner

Metadata

fish-speech fish-speech copied to clipboard

Metadata

← Metadata

Owner

Metadata

fish-speech
fish-speech copied to clipboard