Tianqi Chen comments

Results 637 comments of


                                            Tianqi Chen

Cannot find module 'tvmjs' or its corresponding type declarations EVEN has ‘npm install tvmjs’ done

Make sure you run npm install first to install web-llm, you would need to change the web-llm dependency to 0.2.0 instead of file as being suggested. If you have already...

Cannot find module 'tvmjs' or its corresponding type declarations EVEN has ‘npm install tvmjs’ done

try to do a fresh clone of the repo and try out commands in https://github.com/mlc-ai/web-llm/tree/main/examples/get-started

Models output is scrambled in Safari Technology Preview, which has WebGPU support

This is a known issue that we are working with the safari team to address https://bugs.webkit.org/show_bug.cgi?id=266793

Cannot find module 'tvmjs' or its corresponding type declarations EVEN has ‘npm install tvmjs’ done

That likely mean that you do not have a device that have enough GPU RAM

Cannot find module 'tvmjs' or its corresponding type declarations EVEN has ‘npm install tvmjs’ done

hmm, that seems to be sufficient, maybe you can try out https://webgpureport.org/ to see if you can find the GPU. That could be an issue of your browser, make sure...

Models output is scrambled in Safari Technology Preview, which has WebGPU support

Thank you @mwyrzykowski for enabling this, this issue can be closed. There are smaller models that we use for smaller context (the models that ends with -1k suffix) that might...

Models output is scrambled in Safari Technology Preview, which has WebGPU support

Actually @mwyrzykowski if it is possible to enable iOS to have bigger buffers, it would be nice. Since some of the 3B models can fit well on mobile, and may...

Models output is scrambled in Safari Technology Preview, which has WebGPU support

get it, thank you @mwyrzykowski for explaining, I think we can stay with 256 MB for now then, hopfully some of the the smaller models should still work well.

How to simulate token streaming in a web worker?

the latest API support token streaming via OAI style API

Add WebGL fallback

Unfirtunately WebGL's programming model is limited and cannot enable some fo the optimizations needed, so it likely is infeasible Good news is that Firefox is also working to land WebGPU...