RealtimeSTT
RealtimeSTT copied to clipboard
Handling of pre-recorded audio and float16 to float32 conversion on CPU
I am using the RealtimeSTT library to process pre-recorded PCM audio chunks. However, when feeding the audio, the system seems to expect live audio input and outputs the message "speak now", instead of processing the provided audio file. Additionally, I see a warning that the model's weights are automatically converted from float16 to float32 due to CPU usage, which might impact performance.
"[2025-03-31 10:21:51.636] [ctranslate2] [thread 2156] [warning] The compute type inferred from the saved model is float16, but the target device or backend do not support efficient float16 computation. The model weights have been automatically converted to use the float32 compute type instead. / speak now"