Matteo Fasulo
Matteo Fasulo
Refactored iterate_result function to include a new subtitle_format option that modifies the style of subtitles. The function now iterates through the subtitles and formats the subtitle text based on the...
Adapt the work of @duozokker into the WebUI ("reddit" page) allowing the user to modify the json file (as for main video.json file) **Expected behavior** User should be able to...
### System Info Hi, using transformers.js v3 I encountered the following issue with Whisper model. The issue is already visible at https://github.com/xenova/whisper-web/issues/41#issue-2368797701 The current pipeline does not support retrieving word-level...
### Feature request New feature using VAD for silence suppression. A better description can be found at https://github.com/jianfch/stable-ts?tab=readme-ov-file#silence-suppression ### Motivation Current Whisper implementation fails to capture silences in word-level settings...
### System Info Using transformers.js v3 in latest Chrome release on Windows 10. GPU: Nvidia GTX 1080 (8GB) ### Environment/Platform - [X] Website/web-app - [ ] Browser extension - [...
### Feature request Hello, I am currently using the `_decode_asr` function in your ASR decoding library (Whisper). This function provides an option to return either word-level or sentence-level (chunk-level) timestamps,...