Rotem Dan
Rotem Dan
The cause of the issue sees to be [incorrect file paths in the `onnxruntime-node` post-install manifest](https://github.com/microsoft/onnxruntime/issues/24770#issuecomment-2888175927), pointing to `win-x64` instead of Linux directories. After testing further, it turns out the...
Actually, the error was mine and it was a very significant one! I had a misconception about a `package.json` version like `^1.21.1` applying only up to bug-fix (patch) versions like...
`dtw-ra` uses eSpeak as part of the alignment process, to synthesize reference speech audio. Looking at the eSpeak voice list, I don't see language code `km` (Khmer), also no mention...
The `whisper` alignment engine doesn't rely on eSpeak and the Whisper multilingual models do support Khmer, so it should work. It's kind of slow on Echogarden `v2.x.x`, though, since it...
From my experience, `large-v3-turbo` doesn't perform very well for many tasks. `tiny`, `base` or `small` usually produce best result. Actually, in my experience, for general transcription, `small` usually produces better...
I don't know, maybe Whisper models simply aren't that good in producing accurate Khmer timestamps. If you could possibly send an example pair of an audio file and transcript, I...
Yes, Whisper is producing nonsense transcriptions for this audio, even when language is specified correctly. So, if it can't produce a decent transcription, it can't align as well. I tested...
I'm not exactly sure what you are describing: * If you are asking about speech-to-speech translation (a form of speech synthesis or "speech resynthesis"), that is not something it can...
My current workaround (tested to produce correct results with sizes greater than 4 GiB): ```ts export function concatBuffers(buffers: Buffer[]) { let totalLength = 0 for (const buffer of buffers) {...
[This call to `_copy`](https://github.com/nodejs/node/blob/ee46d2297c648dc6cc8cbc0327c453514b878294/lib/buffer.js#L256) is possibly the reason: ```js function _copyActual(source, target, targetStart, sourceStart, sourceEnd) { if (sourceEnd - sourceStart > target.byteLength - targetStart) sourceEnd = sourceStart + target.byteLength -...