VoxCPM icon indicating copy to clipboard operation
VoxCPM copied to clipboard

doc: A lack of documentation makes it hard to quickly get started.

Open premade-man opened this issue 2 months ago • 3 comments

This project's documentation is completely wretched (or trash). On macOS, app.py won't even run! The most insane part is that for such a great/powerful project, they don't even bother including a requirements.txt!

premade-man avatar Dec 13 '25 19:12 premade-man

I resolve the issue by using following dependencies:


# 主要库:VoxCPM自身可能依赖的基础AI/ML框架
# 如果项目是以PyPI包形式安装(pip install voxcpm),则这些可能是它的运行时依赖
torch>=2.0.0 # 深度学习框架
torchaudio>=2.0.0
numpy>=1.20.0
einops>=0.6.0 # 搜索结果中提到,用于 ComfyUI 节点,可能是模型本身的依赖

# 模型下载和集成相关的库
huggingface_hub>=0.20.0 # 用于下载 VoxCPM 模型(如 VoxCPM1.5)
modelscope>=1.0.0 # 用于下载 ZipEnhancer 和 SenseVoice-Small

# Web Demo 或特定功能的库
gradio>=4.0.0 # 根据 Release Notes,Web Demo 可能依赖 Gradio
wetext>=0.1.0 # 搜索结果中提到,用于文本正则化处理
faster-whisper # 搜索结果中提到,用于 Web Demo 中的语音提示 ASR/转录
pydantic>=2.0.0 # 搜索结果中提到,可能是数据验证或配置管理相关
spaces
funasr
voxcpm
torchcodec

using pip install -r requirements.txt to install dependencies.

and then,

# Activate your environment
conda activate voxcpm

# Install FFmpeg into the environment (Recommended)
conda install -c conda-forge ffmpeg

# Reinstall torchcodec after ensuring PyTorch (2.9.1) is installed:
pip install --force-reinstall torchcodec

problem resolved.

I hope this helps others who are facing the same problem.

premade-man avatar Dec 13 '25 19:12 premade-man

I'm not sure how you installed this library, but all dependencies are listed in the pyproject.toml file. If you encounter any issues during installation, please submit an issue describing your situation.

a710128 avatar Dec 15 '25 06:12 a710128

确实是个垃圾的安装文档 说背了狗币都没有安装文档一说

huangxuegang1129-oss avatar Dec 24 '25 08:12 huangxuegang1129-oss