Rotem Dan
Rotem Dan
1. I don't know why you should use whisper.cpp outside of Echogarden. Echogarden can call any `whisper.cpp` `main` binary you give it (via `whisperCpp.executablePath`) and supports a lot of the...
You mentioned you're possibly using cloud services for transcription, like OpenAI or Azure. OpenAI Whisper service is $0.006 per minute, that's $0.36 per hour. Azure is something like $0.18 per...
The lines possibly related are: ```ts if (tokenIndex === 0 && tokenObject.text === '[_BEG_]' && tokenObject.offsets.from === 0) { currentCorrectionTimeOffset = segmentObject.offsets.from / 1000 } ``` and ```ts startTime =...
Thanks a lot for the investigation. I guess the issue can be reported on the `whisper.cpp` repository, if it hasn't already. For now, I can work around the issue by...
I don't think I need or want pull requests (so far I've closed the two that I got). This has been a personal project of mine. Maybe I'd prefer to...
Although I couldn't reproduce this, I added a workaround for this issue on the new `1.6.0` release. The source diff is [here](https://github.com/echogarden-project/echogarden/commit/5608b75f99e8db706eea0e6cc9de79e20d4acdfe). New version also supports the new `large-v3-turbo` model...
If the default DTW-based alignment engine is applied entire text and audio, then it's likely to get confused by areas of text that are not included in the audio at...
I investigated a bit about acoustic fingerprints for speech and for varying rates. Seems like most fingerprinting methods are for music or sound effects, not specialized for speech, and are...
I haven't got to this yet. I have other things I'm currently working on for the next release. For example I'm working on adding machine translation support (Google Translate and...
Actually the published code does include a basic source file for search, but it's just a "stub". Doesn't contain anything. Over the past months I prioritized mostly fixing issues and...