Flipped
Flipped
显存占用过大疑问
使用示例代码进行推理,4bit的在回答的时候单张图像最大达到了38G的显存占用,是否是正常的。无量化的模型版本直接报错了。 from lmdeploy import TurbomindEngineConfig, pipeline from lmdeploy.vl import load_image engine_config = TurbomindEngineConfig(model_format='awq') pipe = pipeline('internlm/internlm-xcomposer2d5-7b-4bit', backend_config=engine_config) image = load_image('/root/workspace/InternLM-XComposer/examples/cars1.jpg') response = pipe(('describe this image', image)) print(response.text) 
failed to import ttsfrd, use WeTextProcessing instead /opt/conda/envs/cosyvoice/lib/python3.8/site-packages/torch/_jit_internal.py:726: FutureWarning: ignore(True) has been deprecated. TorchScript will now drop the function call on compilation. Use torch.jit.unused now. {} warnings.warn( /opt/conda/envs/cosyvoice/lib/python3.8/site-packages/diffusers/models/lora.py:393: FutureWarning: `LoRACompatibleLinear`...
图像渲染失败
大佬您好,麻烦问一下,使用您这个框架在fastgpt进行知识库问答的时候发现图像渲染不出来,您遇到过吗。 