Whisper issues

missing：prompt PROMPT

3

--prompt PROMPT It's not implemented, it doesn't work

Diarisation with deterministic stereo audio

I was not having much luck with diarisation using the SDK so I moved to Stereo recordings where each channel is a side of a conversation. Does anyone know how...

iamadamreed

STREAM_AUDIO and multiple files

7

When I do main.exe -m ggml-base.bin -f 1.wav 2.wav 3.wav 1.wav 2.wav where wav1 = 1,2,3 (I count and record it in 16bit wav file) wav2 = 4,5,6 wav3 =...

checksummaster

When I try to process large files, this GUI only outputs 50% of the normal output

2

I use the GUI to process long time audio(>20min), it only outputs 50% of the normal output, for other output is duplicate. I try use medium and large model, but...

Jack6811

Question about Timemarks

Hello, would it be possible to add an option for timemark / speaker? Would be an nice setup for interview.

kusco123

cli tool --output-words option don't work (or maybe not implemented)

I found the cli release have the option `--output-words` which should normally generate a wts file, but it don't work, --- Then I tested [whisper.cpp](https://github.com/ggerganov/whisper.cpp) release, and it worked, generated...

HaujetZhao

Could you please make the app describe all the videos in a folder by one click? even the subfolder. Thank you.

1

Could you please make the app describe all the videos in a folder by one click? even the subfolder. Thank you.

Chess888

[Request] an Idea about fixing realtime transcription latency

1

Seems like the winsper.cpp project has an example of realtime transcription using C++. Is it possible to port the same logic to C# using yor implementation of D3D? And a...

DK013

Use key-press state to receieve audio

Currently, the realtime transcript has a noticeable latency > In the current version there’s high latency for realtime audio capture. > Specifically, depending on voice detection the figure is about...

HaujetZhao

can you add a support of faster-whisper model?

in which would be this one https://github.com/guillaumekln/faster-whisper

MrFutureV

Whisper
Whisper copied to clipboard

Metadata

missing：prompt PROMPT

Diarisation with deterministic stereo audio

STREAM_AUDIO and multiple files

When I try to process large files, this GUI only outputs 50% of the normal output

Question about Timemarks

cli tool --output-words option don't work (or maybe not implemented)

Could you please make the app describe all the videos in a folder by one click? even the subfolder. Thank you.

[Request] an Idea about fixing realtime transcription latency

Use key-press state to receieve audio

can you add a support of faster-whisper model?

← Metadata

Owner

Metadata

Whisper Whisper copied to clipboard

Metadata

← Metadata

Owner

Metadata

Whisper
Whisper copied to clipboard