Rotem Dan comments

Results 235 comments of


                                            Rotem Dan

Error with whisper large model

I uploaded the model when I tried adding support for the large Whisper models, especially V3. The lack of support to the large models was also reported earlier, on an...

Error with whisper large model

On `v1.0.0` I just released, I added support for [`whisper.cpp`](https://github.com/ggerganov/whisper.cpp), which supports large models (`large-v1`, `large-v2`, `large-v3`), quantized models, and GPU acceleration (CUDA and OpenCL). All `whisper.cpp` models (including quantized...

Error with whisper large model

Large models can't fully be supported until `onnxruntime-node` receives support for models over 2GB, with external data. `whisper.cpp` engine supports them (all of its models, including quantized ones, are supports)....

Problems aligning certain Japanese characters with Whisper engine

Thanks for the report. The Whisper tokenization is based on a pure JavaScript implementation I made myself, and uses a standard GPT2 tokenizer. This isn't exactly what is used on...

Problems aligning certain Japanese characters with Whisper engine

In the newly released `1.0.x` version, I've integrated the `tiktoken` tokenizer. It can deal with many more types of characters, including some very obscure emojis. The tokenizer implementation is now...

Small issue regarding error message ISO 639-2 codes and `Echogarden.recognize`

Thanks, Two letter language codes are used throughout all synthesis and recognition operations, I believe. The error message makes it look like the language isn't supported. I should change it...

Small issue regarding error message ISO 639-2 codes and `Echogarden.recognize`

I believe this issue has been resolved a long time ago. Language codes are normalized and multiple language code formats (including language names like "French") are accepted (documentation may still...

SMIL output format for the "alignment" command

Yes, now I remember looking into SMIL and EPUB 3 in the past. Seems like the links on the W3C site are mostly broken, including the tutorials. The last update...

SMIL output format for the "alignment" command

Thanks for all the details! One question I ask is: who would this tool be intended for? For the average user, it isn't really that easy to give a seamless...

SMIL output format for the "alignment" command

The difficulty I described was in getting the word highlighting working correctly in the DOM within the browser extension, it wasn't about the accuracy of the timing. For synthesized text,...