Junru Shao
Junru Shao
This is weird. I cannot reproduce the issue on my M2 Max. Does it work with the prebuilt Vicuna-7b?
Sorry I couldn't follow your question. Yes, we do cross-compilation to ARM64 for iOS build, because iOS is certainly ARM64.
@NullCodex on the contrary, it works for a real iphone but doesn’t work with a simulator.
This is a great proposal and is the direction we are working towards. I'm not particularly familiar with Android, so am CC'ing some related devs @spectrometerHBH @cyx-6 @tqchen
You might want to install a more proper Vulkan driver: https://mlc.ai/mlc-llm/docs/install/software-dependencies.html#valkan-driver-validate-installation
@GTP95 how about this one: https://mlc.ai/mlc-llm/docs/install/gpu.html
Seems that you are not using the correct TVM branch. Would you mind double checking: https://mlc.ai/mlc-llm/docs/install/tvm.html#id2
Hi Ziyu, long time no see! You may find our documentation helpful here: https://mlc.ai/mlc-llm/docs/install/tvm.html#id2
Closing as there is nothing actionable further
4G VRAM is not enough for Vicuna-7b.