transformers.js-examples
transformers.js-examples copied to clipboard
Whisper word level timestemp
I am wondering if it is possible to have word level timestamp when using the whisper example? I tried to include the param word_timestamps to the model options but it didn't work.
Any help in how to achieve this will be very welcome. Thanks
They don't appear to work with the distil models. the old xenova ones seem fine though less accurate for me than the sample for the distilled version here https://huggingface.co/spaces/Xenova/distil-whisper-web but it only gives sentence timestamps
@danieloi I managed to do it here https://github.com/apssouza22/video-text-edit
@apssouza22 it works for word-level timestamps when you use the distil variants of the model?
Oh sorry, distilled didn't work, but I thought it was because the models are too big. @danieloi
Same here @apssouza22, I get this error when I want word-level timestamps and use the distil variants in transformers.js:
Error: Layer index 6 is out of bounds for cross attentions (length 4).
at webpack://@huggingface/transformers/./src/models.js:3498:1
at Array.map (
@danieloi yeah. Got the same. Let me know if you manage to get it sorted