inference icon indicating copy to clipboard operation
inference copied to clipboard

Could not download qwen2-moe-instruct q4_k_m automatically

Open Tint0ri opened this issue 1 year ago • 4 comments

System Info / 系統信息

Ubuntu 20.04

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • [X] docker / docker
  • [ ] pip install / 通过 pip install 安装
  • [ ] installation from source / 从源码安装

Version info / 版本信息

0.13.3

The command used to start Xinference / 用以启动 xinference 的命令

正常启动模式

Reproduction / 复现过程

  1. 选择qwen2-moe-instruct,llama.cpp, ggufv2, 14, Q4_K_M,
  2. Launch
  3. 很短时间,UI报错,Server error: 400 - [address=0.0.0.0:40875, pid=194] Model path does not exist: /data/cache/qwen2-moe-instruct-ggufv2-14b/qwen2-57b-a14b-instruct-q4_k_m.gguf
  4. 后台Log:ValueError: [address=0.0.0.0:40875, pid=194] Model path does not exist: /data/cache/qwen2-moe-instruct-ggufv2-14b/qwen2-57b-a14b-instruct-q4_k_m.gguf
  5. /data/cache/qwen2-moe-instruct-ggufv2-14b目录下只有 __valid_download_q4_k_m文件

Expected behavior / 期待表现

下载模型并用llama.cpp后端启动。 ollama可以正常下载模型并启动。

Tint0ri avatar Jul 27 '24 15:07 Tint0ri

Can you remove /data/cache/qwen2-moe-instruct-ggufv2-14b and try again?

qinxuye avatar Jul 29 '24 09:07 qinxuye

Clear cache and try again with q4_k_m, still not work. Maybe the same error related with https://github.com/xorbitsai/inference/issues/1906

Tint0ri avatar Jul 31 '24 13:07 Tint0ri

Did you choose modelscope as downloading hub?

qinxuye avatar Jul 31 '24 14:07 qinxuye

same error with modelscope downloading hub. ValueError: [address=0.0.0.0:37497, pid=170] Model path does not exist: /data/cache/qwen2-moe-instruct-ggufv2-14b/qwen2-57b-a14b-instruct-q4_k_m.gguf

already update docker image to latest version. 0.14.1

Tint0ri avatar Aug 11 '24 01:08 Tint0ri

This issue is stale because it has been open for 7 days with no activity.

github-actions[bot] avatar Feb 13 '25 19:02 github-actions[bot]