Siyuan Feng
Siyuan Feng
Thanks for reporting this. Could you please send a PR to fix it?
Can you please review this? @buptqq
I'm suffering from the same issue with Bert models.
depends on https://github.com/apache/tvm/pull/16848, waiting for the next sync
Waiting for dependencies: https://github.com/apache/tvm/pull/16887 and https://github.com/apache/tvm/pull/16886 BTW, there is a known numerical issue on Vulkan. Will fix it in a follow-up PR.
The current version does not support it yet and it might be hard to modify for it.
I tried a bit but failed since Hexagon is not OPEN for developers. To be specific: 1. 32-bit RTOS with 4GB memory limitation. (Qualcomm can use tricks to support more...
@ningpengtao-coder Thanks for your suggestion. That's a good approach to running models on Android. However, I (as well as the team) do not have extra bandwidth to support NNAPI in...
cc @MasterJH5574 to see if we can enhance the error message
There is a WIP PR: https://github.com/mlc-ai/mlc-llm/pull/2222