whisper.cpp issues

Add process-specific timings

3

Adds functions that use `CLOCK_PROCESS_CPUTIME_ID` instead of `CLOCK_MONOTONIC` for timings, and are therefore not affected by other processes on the system. One thing to check before we merge: since the...

abitofevrything

help wanted

How to give raw audio file to stream instead of microphone?

4

I am trying to live captioning a call.

SmartManoj

question

When using -pc output in the terminal, some Chinese characters cannot be displayed normally

10

just like https://github.com/ggerganov/whisper.cpp/issues/25, when transcribed in zh(chinese), there are still some characters missing, and the model is from ggml-large.bin in hugging-face(https://huggingface.co/datasets/ggerganov/whisper.cpp/tree/main). Maybe the large model and the large-v1 model still...

chenqianhe

enhancement

good first issue

PPC64 big-endian support

4

For the sake of completion of the POWER port, it would be nice if big endian worked. I made some progress on the primitives involved, but I could use some...

fitzsim

question

Should we use process time for benchmarks/timings?

2

Currently, the `bench` tool simply spits out the timings from `whisper_print_timings`. These timings are not process specific and as such are influenced by other processes on the system. Should we...

abitofevrything

enhancement

good first issue

Add a possibility to specify list of languages for each file

It'd be nice to be able to specify a list of languages when passing multiple files to recognize, or to pass a file:lang pairs to let each file be recognized...

frankiedrake

enhancement

good first issue

Decoding strangely slow on i7 Macbook Pro

1

Hey there! This is an awesome project, I'm trying to build a web app using this. Unfortunately running into a weird issue. When transcribing 2-3 seconds of audio, M1/M2 Macs...

cuuupid

performance

Language auto-detect "auto" flag does not work using the stream tool

Attempting to use the `--language auto` option to enable auto-detect with the multi-language model throws an error for the `stream` tool. **Command** ```sh ./stream -m ./models/ggml-base.bin --threads 8 --step 500...

bdrelling

enhancement

good first issue

add WhisperLangAutoDetect method to go binding

RobinXL

node --experimental-wasm-threads --experimental-wasm-simd ../tests/test-whisper.js issue

1

trying to run: node --experimental-wasm-threads --experimental-wasm-simd ../tests/test-whisper.js but get: failed to asynchronously prepare wasm: CompileError: WebAssembly.instantiate(): Compiling function #62 failed: i32x4.splat found empty stack @+5977

silvacarl2

whisper.cpp
whisper.cpp copied to clipboard

Metadata

Add process-specific timings

How to give raw audio file to stream instead of microphone?

When using -pc output in the terminal, some Chinese characters cannot be displayed normally

PPC64 big-endian support

Should we use process time for benchmarks/timings?

Add a possibility to specify list of languages for each file

Decoding strangely slow on i7 Macbook Pro

Language auto-detect "auto" flag does not work using the stream tool

add WhisperLangAutoDetect method to go binding

node --experimental-wasm-threads --experimental-wasm-simd ../tests/test-whisper.js issue

← Metadata

Owner

Metadata

whisper.cpp whisper.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

whisper.cpp
whisper.cpp copied to clipboard