Jianwei Li

Results 1 issues of Jianwei Li

To save GPU memory, I want to load the multilingual model in 4bit mode, the code is as follows. ```python import torch from transformers import AutoTokenizer from mplug_owl.modeling_mplug_owl import MplugOwlForConditionalGeneration...