WhisperLive issues

运行服务器没反应，客户端运行后只输出：[INFO]: * recording

3

我在尝试windows系统下操作。服务器运行命令： `python run_server.py --port 9090 --backend faster_whisper` 客户端代码： ```python from whisper_live.client import TranscriptionClient client = TranscriptionClient( "localhost", 9090, lang="en", translate=False, model="small", use_vad=False, ) ``` 客户端输出日志： ```python PS D:\PY> &...

Jacoo520

Incomplete Transcriptions

7

While using the medium model on a GPU system, I get this error message: `ERROR:root:[ERROR]: Sending data to client: sent 1000 (OK); then received 1000 (OK)` This mean the client...

Nerdward

Whisper-live taking same time on CPU and GPU to transcribe an audio

4

I am using whisper-live==0.2.1 , faster-whisper==0.10.0 and Ctranslate2==4.0.0 Transcribing a 30-second audio file currently requires the same amount of time whether processed on a CPU or GPU, approximately 2 minutes....

prem1303

docker pull is very slow

4

`docker pull ghcr.io/collabora/whisperbot-base:latest` is very slow, taking about 20 min to pull with a data center network connection. Is there any way to speed this up?

finnatsea

Hallucinating conclusive remarks ("Thanks for watching!", "That's all!", etc.) with non-speech noise just above VAD threshold

7

Running the medium English model with VAD enabled, I've noticed a tendency to hallucinate phrases like "Thanks for watching!", "Thanks!", "That's all," etc.. I assume it's receiving some audio data...

kjhenner

bug

HLS Transcription Stops After A Couple Minutes

4

I set up a local HLS stream playing a long video of someone talking. Everything seems great until after exactly 2 minutes in the transcription stops completely. INFO:faster_whisper:Processing audio with...

austinm1120

help wanted

Docker compose tensorrt

2

- optimize tensorrt docker build with multi stages - docker compose setup to build and run container with any model size from command line which builds the models on runtime...

makaveli10

GPU mode consumes half of available CPUs

1

on the version 0.4.1, CPU transcription consume less cpu than GPU +CPU. with: docker run -it -p 9090:9090 ghcr.io/collabora/whisperlive-cpu:latest ![image](https://github.com/collabora/WhisperLive/assets/3034596/4e719839-2845-4045-bc85-6043a8917717) With GPU: docker run -it --gpus all -p 9090:9090 ghcr.io/collabora/whisperlive-gpu:latest...

zucher

The concept of transcribed bilingualism.

7

When recording in real time, for example, I specified the output voice to be English, and the audio played was a mixture of Japanese and English, sometimes translating Japanese into...

Ye83

Add options for handling multilingual input

2

This is an incomplete PR intended to start on addressing issues semi-related to #184. The `multilingual_input` option controls whether multiple languages should be expected in the input stream. If False...

jsichi

WhisperLive
WhisperLive copied to clipboard

Metadata

运行服务器没反应，客户端运行后只输出：[INFO]: * recording

Incomplete Transcriptions

Whisper-live taking same time on CPU and GPU to transcribe an audio

docker pull is very slow

Hallucinating conclusive remarks ("Thanks for watching!", "That's all!", etc.) with non-speech noise just above VAD threshold

HLS Transcription Stops After A Couple Minutes

Docker compose tensorrt

GPU mode consumes half of available CPUs

The concept of transcribed bilingualism.

Add options for handling multilingual input

← Metadata

Owner

Metadata

WhisperLive WhisperLive copied to clipboard

Metadata

← Metadata

Owner

Metadata

WhisperLive
WhisperLive copied to clipboard