Flipped

Results 13 issues of Flipped

使用示例代码进行推理,4bit的在回答的时候单张图像最大达到了38G的显存占用,是否是正常的。无量化的模型版本直接报错了。 from lmdeploy import TurbomindEngineConfig, pipeline from lmdeploy.vl import load_image engine_config = TurbomindEngineConfig(model_format='awq') pipe = pipeline('internlm/internlm-xcomposer2d5-7b-4bit', backend_config=engine_config) image = load_image('/root/workspace/InternLM-XComposer/examples/cars1.jpg') response = pipe(('describe this image', image)) print(response.text) ![image](https://github.com/user-attachments/assets/7a79b2bb-d360-4c67-8077-130c5d74315a)

failed to import ttsfrd, use WeTextProcessing instead /opt/conda/envs/cosyvoice/lib/python3.8/site-packages/torch/_jit_internal.py:726: FutureWarning: ignore(True) has been deprecated. TorchScript will now drop the function call on compilation. Use torch.jit.unused now. {} warnings.warn( /opt/conda/envs/cosyvoice/lib/python3.8/site-packages/diffusers/models/lora.py:393: FutureWarning: `LoRACompatibleLinear`...

stale

大佬您好,麻烦问一下,使用您这个框架在fastgpt进行知识库问答的时候发现图像渲染不出来,您遇到过吗。 ![局部截取_20240829_103957](https://github.com/user-attachments/assets/db2ce153-ac2b-4ac8-868f-03bc282df6a8)