卡顿
我运行在mac mini 4 和jetson orin agx 64g 在做这个realtime voice 生成上。 生成的声音有点重复。然后说话声音一卡一卡的
[vibevoice_realtime_audio_2025-12-08T03-49-
vibevoice_realtime_audio_2025-12-08T03-49-30-420Z.wav
30-420Z.wav](https://github.com/user-attachments/files/24023298/vibevoice_realtime_audio_2025-12-08T03-49-30-420Z.wav)
The generated WAV file is normal, which suggests that the model output is correct. However, your hardware is likely not powerful enough to support real-time inference. When generation cannot keep up with playback, it sounds like stuttered.
谢谢哈。明白了看来还是得 用好一点的gpu
但是我用mac studio ultra m3 256g mps 推理也是一样的。 inference step 5 cfg scale 1.5
是不是得用mlx 去支持一下这个模型
mpx is already supported. You can try to add --device mpx when launching the demo.
python demo/vibevoice_realtime_demo.py --model_path microsoft/VibeVoice-Realtime-0.5B --device mpx
你看 这个是mps 是pytorch 版本的但是用mlx 是apple 自家的ai加速库这样兼容一下是不是会更快一点呢
Does it resolve the issue of stuttering? I’m not very familiar with Apple devices. If you have any technical suggestions, feel free to open a PR.
没有。感觉苹果的电脑运行还是会卡顿。 可能支持不够。算力不够? 无法流畅的做。我们在4070ti super 和 agx spark 上做都可以做到流畅。但是apple 的m3 studio ultra 上始终会卡顿。
M4 Pro works well.