Jing Tang comments

Results 8 comments of


                                            Jing Tang

先选择文件，后修改 data，data值没有添加上去

> inputFile (newFile, oldFile) { if (newFile && oldFile) { oldFile.data = selfData (需要上传的参数) } } 为什么赋值后oldFile.data 是undefined

先选择文件，后修改 data，data值没有添加上去

@lian-yue 我也遇上了这样一个问题，多个文件上传时，每个文件都要根据上个文件上传完毕的response值来更新其：data值然后在上传。现在是如果一起选择多个文件，和文件一起发送的data值都是一样的，没法动态更新，可以帮忙解答一下这个问题吗，谢谢。

Comparing WhisperX and Faster-Whisper on RunPod: Speed, Accuracy, and Optimization

whisperX has way better timestamp alignment than faster-whisper.

【BUG】字幕识别10-20%出现遗漏，视频里部分字幕识别不出来的问题。

> 这是 demucs 进行人声分离导致的 whisper 识别错误率升高的问题，这一点目前暂时没有完美的方案，如果不进行人声分离有时候会因为背景声导致 whisperX 的时间轴对齐出现严重错误。我在whisper discussion看到这个 https://github.com/openai/whisper/discussions/29 不知道是不是和原生wisper使用的VAD的参数有关，下面有些解决办法，可以试一下

【BUG】字幕识别10-20%出现遗漏，视频里部分字幕识别不出来的问题。

> 这个错误现在发现了，是 demucs 处理音频后声音会变小，影响 whisperX 的转录过程，现在 v2.0.4 会调大音量再送入转录代码见 [c1de9c9](https://github.com/Huanshere/VideoLingo/commit/c1de9c9d9428c8821b88f7120ac70fbd1d7f13af) 谢谢试用了下，调大音量后enhanced_vocals.wav的文件比原文件vocals.wav size要小？请问demucs之前是用什么库分离人声，我怎么感觉那个分离效果要好很多？

Just use this script to make the srt more readable for the end results. almost perfect, try it and share your thoughts.

> [@search620](https://github.com/search620) have you tried https://github.com/awerks/whisperX/blob/main/whisperx/SubtitlesProcessor.py? > > Another one I saw is [#829 (comment)](https://github.com/m-bain/whisperX/issues/829#issuecomment-2207004192) someone already tried both methods? which one is better,? using deepmultilingualpunctuation, or Spacy Fragmenter