mlc-llm issues

Invalid bitcast %222 = bitcast <8 x i32> %221 to <8 x half>

3

Output from running `mlc_chat_cli`: ```(mlc-chat) 2024mgagvani@snowy:~$ mlc_chat_cli WARNING: lavapipe is not a conformant vulkan implementation, testing use only. Use lib /cluster/2024mgagvani/dist/lib/vicuna-v1-7b_vulkan_float16.so Initializing the chat module... Finish loading You can use...

mgagvani

trouble shooting

Android support

2

Pretty self-explanatory: can we get an Android version of this?

Pandapip1

enhancement

Error:VK_ERROR_INCOMPATIBLE_DRIVER

2

my environment is win10 WSL2 Ubuntu22.04, and I follow the command: `conda create -n mlc-chat` `conda activate mlc-chat` `conda install git git-lfs` `conda install -c mlc-ai -c conda-forge mlc-chat-nightly` `mkdir...

Wings236

bug

type: trouble shooting

Check failed: (__e == VK_SUCCESS) is false: Vulkan Error, code=-2: VK_ERROR_OUT_OF_DEVICE_MEMORY

win10 x64 8G RAM NVIDIA GeForce 940MX after mlc_chat_cli command: Use lib E:\Code\test\mlc-chat\dist\lib\vicuna-v1-7b_vulkan_float16.dll Initializing the chat module... [16:56:46] D:\a\utils\utils\tvm\src\runtime\vulkan\vulkan_buffer.cc:61: --------------------------------------------------------------- An error occurred during the execution of TVM. For more...

Atlantis12000

[Feature Request] Support new tokenizer format in tokenizer.cpp port

# The issue Currently, our tokenizer.cpp port only supports load from [a single json file](https://github.com/mlc-ai/mlc-llm/blob/5bdcc86a632c7105ac2b874d7d255685839dd204/3rdparty/tokenizers-cpp/tokenizers.h#L81-L84), which is the [legacy format](https://huggingface.co/docs/transformers/v4.28.1/en/internal/tokenization_utils#transformers.PreTrainedTokenizerBase.save_pretrained) of hugging face tokenizer that is only applicable to fast...

yzh119