LLM-TPU
LLM-TPU copied to clipboard

Published 20 hours ago •

Reame
Issues

在跑github下载已经转好的qwen-vl-chat-combine.bmodel模型时，会提示内存不足

Open xuyang1102 opened this issue 6 months ago • 3 comments

用bmrt_test --bmodel 测试模型时发现的这个问题 IMG_20240801_222640 IMG_20240801_222701 IMG_20240801_222721 IMG_20240801_230835

Aug 01 '24 15:08 xuyang1102