ms-swift icon indicating copy to clipboard operation
ms-swift copied to clipboard

目前支持minicpm_v_v2_5_chat的多卡推理吗?

Open rueing opened this issue 1 year ago • 3 comments

单卡推理没问题,设置tp=2、4、8多卡推理的时候,Aborted(core dumped)

rueing avatar Aug 06 '24 03:08 rueing

device_map方式应该是没问题的

tastelikefeet avatar Aug 06 '24 07:08 tastelikefeet

device_map方式应该是没问题的

import os os.environ['CUDA_VISIBLE_DEVICES'] = '6,7'

model_type = ModelType.minicpm_v_v2_5_chat lmdeploy_engine = get_lmdeploy_engine(model_type, model_id_or_path='/home/llm/MiniCPM/MiniCPM-Llama3-V-2_5', tp=2) template_type = get_default_template_type(model_type) template = get_template(template_type, lmdeploy_engine.hf_tokenizer) lmdeploy_engine.generation_config.max_new_tokens = 256 generation_info = {}

这样直接在lmdeploy_engine这一步Aborted (core dumped)了 你说的device_map方式是怎么样的使用姿势呀?

rueing avatar Aug 06 '24 09:08 rueing

已经支持vllm & vlm, 拉取一下main分支

Jintao-Huang avatar Aug 08 '24 13:08 Jintao-Huang