kiron111 issues

Results 9 issues of


                                            kiron111

Can you add a function to transcribe multiple files or even transcribe entire folder?

Because I have to transcribe multiple audio/ video files each time, it's not so convenient to click and wait a file to finish. Thank you for your development of such...

when I transcribe japanese video, sometime whole script file repeat same dialoge

for example: 1 00:00:00,000 --> 00:00:02,000 I'm not sure if I'm going to be able to get through this. 2 00:00:02,000 --> 00:00:04,000 I'm not sure if I'm going to...

會支持粵語嗎？

### Is your feature request related to a problem? 如題，謝謝 ### Describe the solution you'd like. _No response_ ### Describe alternatives you've considered. _No response_ ### Additional context. _No response_

enhancement

可不可以跳過制作數字人，直接音頻和視頻配口形？

就是單純快速配一段短片，不想聲音每次訓練一個加人一個純配口形的功能謝謝!!

用本地相片合成會出問題

### 是否已存在类似问题？ - [x] 我已搜索现有问题 ### 当前行为我是window 11, 用docker 部署版本 1.2.5 (用Pexel 是能成功合成的,)，但用自己上传的图片，程式是能生成"图片生成的视频"，但合成一个无声音完整片段combined-1.mp4，就说没找到片段合成，然后报错 ### 预期行为正常是会合成combined-1.mp4 ### 重现步骤但用自己上传的图片 ### 堆栈追踪/日志 ``` ## preprocess local materials 2025-05-09 15:54:30.911...

bug

添加接入Kokoro-TTS api

### 是否已存在类似的功能请求？ - [x] 我已搜索现有的功能请求 ### 痛点建议可增加功能，可女以接上Kokoro-TTS api 的项目 https://github.com/PierrunoYT/Kokoro-TTS-Local 这应该是开源tts 中，最不讲究性能的一款，用GPU 几秒可生成一分钟的朗读音频，用CPU 也就长一点 (跟edge tts差不多吧) 有高低抑扬顿挫，不太机械声(中文/普通话都适配) ### 建议的解决方案项目有gradio 功能, 应该可以用api 调用单独部署也可适用 ### 有用的资源 https://github.com/PierrunoYT/Kokoro-TTS-Local ###...

enhancement

kiron111

Can you add a function to transcribe multiple files or even transcribe entire folder?

when I transcribe japanese video, sometime whole script file repeat same dialoge

會支持粵語嗎？

可不可以跳過制作數字人，直接音頻和視頻配口形？

用本地相片合成會出問題

添加接入Kokoro-TTS api

[Feature]: 加上自己上載語音的功能

Suggestion: APP adds an openai format API fuction

Feature Request: add support for vision model InternVL3_5