Matteo Fasulo issues

Results 6 issues of


                                            Matteo Fasulo

Add subtitle format and font color in word_options dict

Refactored iterate_result function to include a new subtitle_format option that modifies the style of subtitles. The function now iterates through the subtitles and formats the subtitle text based on the...

[WebUI] Reddit video generation

Adapt the work of @duozokker into the WebUI ("reddit" page) allowing the user to modify the json file (as for main video.json file) **Expected behavior** User should be able to...

bug

enhancement

help wanted

WebGPU pipeline supporting word-level timestamps #41

### System Info Hi, using transformers.js v3 I encountered the following issue with Whisper model. The issue is already visible at https://github.com/xenova/whisper-web/issues/41#issue-2368797701 The current pipeline does not support retrieving word-level...

bug

Add Whisper Voice Activity Detector (VAD) or Silero VAD for silence suppression

### Feature request New feature using VAD for silence suppression. A better description can be found at https://github.com/jianfch/stable-ts?tab=readme-ov-file#silence-suppression ### Motivation Current Whisper implementation fails to capture silences in word-level settings...

enhancement

[Severe] Memory leak issue under WebGPU Whisper transcribe pipeline

### System Info Using transformers.js v3 in latest Chrome release on Windows 10. GPU: Nvidia GTX 1080 (8GB) ### Environment/Platform - [X] Website/web-app - [ ] Browser extension - [...

bug

Support for Both Word-Level and Sentence-Level Timestamps in ASR Decoding

### Feature request Hello, I am currently using the `_decode_asr` function in your ASR decoding library (Whisper). This function provides an option to return either word-level or sentence-level (chunk-level) timestamps,...

enhancement