Zihao Ye
Zihao Ye
You can follow this tutorial: https://mlc.ai/mlc-llm/docs/tutorials/compile-models.html
Not yet, we will have tutorials on how to add new model architectures like GLM, and you are welcomed to contribute.
We had discussions on this before and part of the reason we didn't make it a submodule is to avoid duplication (people have already cloned relax and in that case,...
Hi @tobrun already merged, thanks for your suggestions.
Hi @ziyu-guo , I think we have prebuilt wheels for M1 Mac: ``` pip install --pre mlc-ai-nightly -f https://mlc.ai/wheels ``` should work for you. If you really need to install...
Thanks for suggesting the paper, we'll read it and see how can we benefit from SnapFusion. Yes, TVM Unity could be used to deploy Stable-Diffusion models, we already have a...
@spectrometerHBH @jinhongyii are writing tutorials on adding new model architectures in MLC-LLM.
You might checkout this branch of DGL: https://github.com/kira-lin/dgl/tree/tvm_integration
I don't think Featgraph has better performance against cusparse for GCN on GPU, see table IV in the paper, since DGL uses cusparse, it's normal that you don't observe any...
Sorry I just noticed that you were using `dgl.sparse._CAPI_FG_LoadModule("../build/featgraph/libfeatgraph_kernels.so")` to use `featgraph` as backend, actually the integration was abandoned because TVM do not have native sparse support and we might...