EmotiVoice 建议

建议

Open lucasjinreal opened this issue 1 year ago • 4 comments

生成的phnoeme text 并没有包含说话人，情绪和原始内容，然后直接推理的时候又会切片最后index error。要么就写一个脚本直接从txt 生成audio，要么分两步就全部生成，不要前后逻辑对不上。

Nov 14 '23 02:11 lucasjinreal

我猜是前端预测模块部分没有放出来，给出的text中有sp2的标注但是转phnoeme不存在，他们给了折中的

Nov 14 '23 11:11 wang-b22

和预测无关，txt里面应该至少hardcodeed几个说话人和情绪，不然使用者比较迷茫，readme也没有相关说明。

Nov 14 '23 11:11 lucasjinreal

Apologies for any confusion caused. We will proceed by adding additional documentation and potentially making modifications to the code.

Nov 15 '23 02:11 syq163

感觉用情感分析自动判断一下也是可以的。

Nov 15 '23 14:11 shirubei