Jakub Piotr Cłapa comments

Results 77 comments of


                                            Jakub Piotr Cłapa

The group_by_keys function in tariterators.py

I think I've seen this error when I had one shard containing just a single sample (in two files) and I was doing infinite reshuffling. I think it randomly reselected...

Remove the dpipes replace directive

Hey, I think it would be nice to merge this to make the tool a bit easier to install for people?

possibly use MLX for MacOS users with WhisperSpeech

@signalprime Sure, I'll see what I can do :)

possibly use MLX for MacOS users with WhisperSpeech

@signalprime Btw. do you have a Discord? Maybe we could have a chat there?

cudaMallocAsync does not yet support checkPoolLiveAllocations. If you need it, please file an issue describing your use case.

The "Profiler function" warning sohuld be 100% harmless. I am not sure where forking may be used during inference. Were you able to figure out the cause of the issue?

What is the performance of WhisperSpeech?

Manual listening tests with MOS seems to be the only reliable metric right now. Could be an interesting community project to make a leaderboard for TTS models with crowdsourced scoring.

Unknown error message, just FYI

I am not getting the error you are seeing with these samples. They are not spoken correctly but the model finished generating successfully. Would you mind trying to find a...

Feature Request: also provide models in torchscript.

It may be worth looking into the recent [ExecuTorch](https://pytorch.org/executorch/stable/intro-how-it-works.html) as a possible alternative. WhisperSpeech already works with `torch.compile` but I think we had some issues with `torch.export` around our implementation...

Magic number in SADelARTransformer

Hey, good question :) The `8` is `sqrt(64)` which is the default `head_width` (so by default, the whole expression is going to be `query_mult * 1`). Similar to the `3`...

Offline example

Hey, you need to download all the models manually and everything else should work. Here is an example that pulls the models into a Docker image: https://github.com/collabora/WhisperFusion/blob/c58a8d1446a768833811aca9dbdfb6388957fdcb/docker/scripts/setup-whisperfusion.sh#L19-L23