Junru Shao
Junru Shao
@shashidhar771892 Hey thanks for sharing! What do you mean by central strings though? I'm happy to help but may need some extra info
Landed and please check out this doc for details: https://mlc.ai/mlc-llm/docs/tutorials/runtime/mlc_chat_config.html
This is great to see! We haven't started collected RP data, but this is really awesome news!
Closing as there's no further actionable issue, but benchmarking is indeed super interesting thing to do!
``` [18:13:12] /Users/wenkeyu1/Desktop/mlc-llm/tvm-unity/src/target/llvm/llvm_module.cc:418: Architecture mismatch: module=arm64-apple-macos host=x86_64-apple-darwin22.3.0 ``` It seems indicative of the underlying issue. The module is of ARM architecture, while the host is x86_64.
@kywen1119 You can definitely run `build.py` no matter it's an Intel or ARM macbook without problem, but I will need more details to help you with your case. Could you...
``` Target configured: metal -keys=metal,gpu -libs=iphoneos -max_function_args=31 -max_num_threads=256 -max_shared_memory_per_block=32768 -max_threads_per_block=256 -thread_warp_size=1 ``` Just wanted to double check, are you compiling for Android or iOS? The target looks very much like...
@kywen1119 The reason is that our quantization system currently assumes the ARM CPU is used on macOS: https://github.com/mlc-ai/mlc-llm/blob/8f1386fe5aeb4342e0d7287863a3b7b2a072ed13/mlc_llm/utils.py#L372 This assumption is definitely unnecessarily strong. To support x86 CPUs, we have...
Hey thanks for asking! I believe `tir.Select` is a bit different from `tir.IfThenElse` in terms of semantics. @Hzfengsy would you like to confirm? To make sure I understand this question,...