Rotem Dan

Results 235 comments of Rotem Dan

I also now noticed that I didn't compile `speex-resampler` to WASM with the `-s MAXIMUM_MEMORY=4GB` flag, like I do with other WASM libraries, like `fvad`, so it may be that...

Seems like what is happening is that now the synthesized speech, produced as part of the DTW alignment process, is causing the same error while it is being downsampled to...

I've made the changes to the wave decoder and encoder to support lengths larger than 4 GiB. I can now align a 3.5 hour audio file in about 5 minutes...

I've released [`v1.3.1`](https://github.com/echogarden-project/echogarden/releases/tag/v1.3.1) with the fixes. It should work with long audio durations, say, up to like 3 to 4 hours, but 9 hour file may be a bit too...

This is the current logic used to set the DTW window duration when it's not given: ```ts if (options.dtw!.windowDuration == null) { const sourceAudioDuration = getRawAudioDuration(sourceRawAudio) if (sourceAudioDuration < 5...

I released [`1.4.0`](https://github.com/echogarden-project/echogarden/releases/tag/v1.4.0). I reworked the auto-selection of granularities and window durations. So now: * Less than 1 minute: high granularity, 1 minute window * Less than 5 minutes: medium...

This issue should be fixed now. On `v2.0.0` I also added the ability to specify a window duration using a percentage value like `dtw.windowDuration=20%`. It's computed relative to the total...

The [server documentation](https://github.com/echogarden-project/echogarden/blob/main/docs/Server.md) says: > The protocol is based on binary WebSocket messages, for both request and response objects. Messages are encoded using the [MessagePack](https://msgpack.org) encoding scheme. The MessagePack page...

This general area is what I'm primarily working on right now. Speech-to-transcript alignment attempts to find the approximate timing of words and sentences of the transcript you give it, in...

The file isn't generated. It's the main eSpeak-NG data file that contains all the voices and other information. It is loaded by the Emscripten version of eSpeak-NG. The code that...