Yongrae Jo
Yongrae Jo
There are other cool arguments parser libraries like http://docopt.org/ as well. We could generalize this issue by supporting json format parsed arguments, instead of relying on a specific format.
@antoyang To extract speech transcripts, I used Whisper model (base). The ASR results seem okay, but I still cannot reproduce the results from the paper. For YouCook2, the highest score...
@antoyang Could you elaborate on the sentence segmentation tool? I used the ASR result from Whisper as it is, like in the same input below. It has start and end...
Hi, @thechargedneutron . I still cannot reproduce the results. With ASR results from Whisper I could improve the results a bit, but it was still far below the reported numbers...
@antoyang I tried WhisperX and the results slightly got better, but still far below the reported performance. (e.g., METEOR 4.3 on YouCook2 with Whisper -> 4.5 with WhisperX vs. 9.3...