swift issues

Results 206 swift issues

Sort by recently updated

可以支持一下InternLM2-Math-Plus-Mixtral8x22B的微调吗

这个模型效果非常不错,数学上面接近gpt4o了

zhangfan-algo

enhancement

infer 无法跑完所有data

CUDA_VISIBLE_DEVICES=0 \ NPROC_PER_NODE=1 \ nproc_per_node=1 \ swift infer \ --ckpt_dir "output_llava/llava1d6-mistral-7b-instruct/v32-20240524-165418/checkpoint-2003" \ --custom_val_dataset_path finetune_dataset/test.json \ --repetition_penalty 1. \ --merge_lora false \ --show_dataset_sample "-1" \ 我的test.json有5万条，但是跑这个代码时只显示有1千条 2.0.5版本无法完成infer所有data，2.0.4的可以。麻烦修改一下bug

AlexJJJChen

bug

多模态模型（qwen_vl_chat）量化失败

使用以下命令对qwen_vl_chat多模态模型量化失败，请问目前支持多模态模型的量化吗？ ############################## CUDA_VISIBLE_DEVICES=6 swift export \ --model_type qwen-vl-chat \ --quant_bits 4 \ --dataset coco-en-mini \ --quant_method gptq \ --quant_output_dir output/quant ############################# 报错信息 ############################ Traceback (most recent call last): File "/miniconda/lib/python3.9/site-packages/swift/cli/export.py",...

Luccadoremi

enhancement

微调qwen后会循环输出

![image](https://github.com/modelscope/swift/assets/52395960/67da5e9c-915d-4cd9-9f18-950110309bb2) ![image](https://github.com/modelscope/swift/assets/52395960/67da5e9c-915d-4cd9-9f18-950110309bb2) ![Uploading image.png…]()

sherry085

question

多模态提前将图片处理，然后再训练LLM

**Describe the feature** 多模态提前将图片处理，然后再训练LLM

choyakawa

question

微调minicpmv2时cpu占用率超高

**Describe the bug** 使用swift sft命令指定dataset 和val_dataset 进行微调: ```bash CUDA_VISIBLE_DEVICES=0 swift sft --model_type minicpm-v-v2-chat ``` 20 个核心全部占用60%左右 ![image](https://github.com/modelscope/swift/assets/2800905/ee1f76d8-c434-4507-b41b-89c1d3bb6004) **Your hardware and system info** CUDA：12.2 系统：Ubuntu22.04 GPU：RTX4090 Torch：2.2.2 **Additional context** Add any...

strawhatboy

swift
swift copied to clipboard

Metadata

可以支持一下InternLM2-Math-Plus-Mixtral8x22B的微调吗

infer 无法跑完所有data

多模态模型（qwen_vl_chat）量化失败

微调qwen后会循环输出

多模态提前将图片处理，然后再训练LLM

微调minicpmv2时cpu占用率超高

← Metadata

Owner

Metadata

swift swift copied to clipboard

Metadata

可以支持一下InternLM2-Math-Plus-Mixtral8x22B的微调吗

infer 无法跑完所有data

多模态模型（qwen_vl_chat）量化失败

微调qwen后会循环输出

多模态提前将图片处理，然后再训练LLM

微调minicpmv2时cpu占用率超高

← Metadata

Owner

Metadata

swift
swift copied to clipboard