liuhuang31
liuhuang31
@yl4579 Thank you for sharing so many ideas! Use Aishell3 dataset, I can synthesize normal audio, and it sounds good. But when generate a unseen speaker, the timbre doesn't sound...
> I use pinyin for asr and styletts, can generate a normal and good results.
> For mandarin, i didn't use ipa_phonemes, use pinyin's initials and finals phonemes. 1. You can use pypinyin to generate pinyin. 2. The _initials and _finals used in pypinyin, then...
> sorry to forget to reply, i didn't change in models.
> > > > > > > > > sorry to forget to reply, i didn't change in models. > > Hi, liuhuang31 How did you train the Chinese pinyin...
> hello, the pypinyin does not perform well someways. So i use another phoneme set, not like pypinyin. In this way, how can i prepare the filelists and how to...
@wangchao112211 hello兄弟解决没,是啥原因
@FireRedTeam 识别结果正确,就是速度慢: 单句识别:别的单句rtf0.03,这句的rtf0.3 batch识别:不包括这句,batch rtf0.03;包括了这句话,batch rtf0.3,速度直接慢了10倍。
我是直接python代码,大概这样子调用的 `from fireredasr.models.fireredasr import FireRedAsr import json import os import pandas as pd from datetime import datetime input_dir = "/root/storage/liuhuang.lh/workspace/nlp_llm/FireRedASR/0219_audiobbok" output_jsonfile = "/root/storage/liuhuang.lh/workspace/nlp_llm/FireRedASR/0219_audiobbok/audiobook_60.csv" wav_lists = os.listdir(input_dir) batch_uttid = [] batch_wav_path...