speaker-transcription issues

ModelError: Prediction interrupted; please retry (code: PA)

8

Unsure if this is a replicate issue not. I run the model on in a 8 minute mp3 via the api which ran fine in the expected amount of time....

gkamradt

Question about embedding model choice

1

Hey you, thank you for the package :) I'm researching around how to improve diarization errors related to overlapping speech, and I'd like to ask you about your choice of...

naripok

Not compatible with amd64 (apple silicon)

I run an M2 Macbook and attempted to run this on my computer. The docker couldn't run because it failed to find a GPU (it was looking for Nvidia), and...

jeffhaskin

Error 413 when calling Meronym docker

I get this error when I call the Meronym API (hosted on a remote server) with a file of more than 100mb. Is there an env var I could assign...

GuillaumeDeWin

What can we do to customize larger GPUs?

I am trying to use this with a large audio input (3.5 hours or so). Since the GPU it uses is fixed, replicate.com fails with: `Prediction failed for an unknown...

arnab

Question: Support for Spanish

1

What does it takes for doing this but for spanish language? If you outline the steps I need to do, I will give it a shot

Agusteando

Questions: guide to understanding `embeddings`?

I have not been able to find much information on what the `speakers.embeddings.` signifies. For example, some example output from this model: ```json { "segments": [...], "speakers": { "count": 2,...

arnab

Allow specifying whisper model

Similar to #2 - it would be great if we could specify the `whisper` model to use: `large/base` etc. Using smaller models would probably be fine for my workflow right...

arnab

speaker-transcription
speaker-transcription copied to clipboard

Metadata

ModelError: Prediction interrupted; please retry (code: PA)

Question about embedding model choice

Not compatible with amd64 (apple silicon)

Error 413 when calling Meronym docker

What can we do to customize larger GPUs?

Question: Support for Spanish

Questions: guide to understanding `embeddings`?

Allow specifying whisper model

← Metadata

Owner

Metadata

speaker-transcription speaker-transcription copied to clipboard

Metadata

← Metadata

Owner

Metadata

speaker-transcription
speaker-transcription copied to clipboard