tttalshaul

Results 15 comments of tttalshaul

Audio length is 3:43 hours after cutting silence (using VAD). embeddings: scale0 6.6M len 8920 scale1 7.9M len 10704 scale2 9.9M len 13385 scale3 14M len 17846 scale4 20M len...

@nithinraok What tests can I make to understand the cause of this problem? I'm moving ClusteringDiarizer(cfg=cfg).to(device), and I see that device is cuda:0. I understand that this only effects vad...

tried now embeddings_per_chunk 50000, and also tried 500000. gpustat showed 0 GPU utilization after embedding, there was a small step of get_argmin_mat and then OOM.. (There wasn't enough time to...

Also, what is the supported limit of audio length with long form audio clustering? (Is is less than 4 hours?) Increasing embeddings_per_chunk will consume less memory in CPU RAM? or...

@tango4j cuda.is_available() is True, device is cuda:0 (as I've mentioned). I'm using ClusteringDiarizer, is NeuralDiarizer more recommended? or might behave elsewise regarding doing all computations on GPU and not CPU?...

@tango4j Thank you, Do you have an example of how to use Chime8 diarization inference with custom input files? jupyter notebook or other example? I'll try that..

I've managed to run chime8 diarization somehow on a custom recording I've choosed but I've got not good results.. I'm using currently the LongClusteringDiarizer from gabitza-tech fork.. it splits the...

@tango4j Have you got the 4hour audio samples? When do you think you will be available to approve the PR? It's waiting for 3 months already.. Thank you.