Georgi Gerganov

Results 136 issues of Georgi Gerganov

I just had an awesome idea: Make a web-page that: - Listens when someone speaks - Transcribes the words using [WASM Whisper](https://github.com/ggerganov/whisper.cpp/tree/master/examples/whisper.wasm) - Generates a new sentence using [WASM GPT-2](https://github.com/ggerganov/ggml/tree/master/examples/gpt-2)...

ideas

In order to be as minimalistic as possible, it would be nice to avoid having [dr_wav.h](https://github.com/ggerganov/whisper.cpp/blob/master/examples/dr_wav.h) in the project. We use it for a very small subset of functionality -...

enhancement
good first issue

There seems to be significant interest for a voice assistant application of Whisper, similar to "Ok, Google", "Hey Siri", "Alexa", etc. The existing [stream](https://github.com/ggerganov/whisper.cpp/tree/master/examples/stream) tool is not very applicable for...

ideas

ref #68, #188, #197 - Add the "max_initial_timestaamp" token logic from OpenAI - Disallow sampling timestamps that are in the past - Add fallback strategy when the timestamp token sampling...

We now have a sample script for transcribing radio livestream, such as BBC radio: ```bash $ ./examples/livestream.sh http://a.files.bbci.co.uk/media/live/manifesto/audio/simulcast/hls/nonuk/sbr_low/ak/bbc_world_service.m3u8 10 small.en [+] Transcribing stream with model 'small.en', step_s 10 (press Ctrl+C...

good first issue
ideas

ref #220 WIP IN PROGRESS

ref #129 The performance is comparable to when running `whisper.cpp` in the browser via WASM. The API is currently very rudimentary: https://github.com/ggerganov/whisper.cpp/blob/npm/bindings/javascript/emscripten.cpp I am hoping that there will be interest...

ref #190 #235 "Guided mode" allows you to specify a list of commands (i.e. strings) and the transcription will be guided to classify your command into one from the list....

It does not help. Keeping this for future references

ref #278 #133 #172 #255 #270 WIP in progress The goal of this PR is to reach OpenAI decoding parity and potentially go beyond There are several ideas for improving...