Charlie Ruan comments

Results 166 comments of


                                            Charlie Ruan

Gibberish output with `Llama-2-7b-chat-hf-q4f32_1`

OOM errors in `createBuffer()` are now able to be caught in webllm npm 0.2.36 via https://github.com/mlc-ai/web-llm/pull/402. If `createBuffer()` does not suffice, will follow up with more error catching in tvmjs....

Gibberish output with `Llama-2-7b-chat-hf-q4f32_1`

Closing this issue as completed; feel free to open new ones if problems persist!

[Tracking] Model loading/caching enhancements

> I can definitely take this one! Im actually taking it for a project of mine. @DavidGOrtega Thanks for offering help! You are referring to item C3 right? > My...

[Tracking] Model loading/caching enhancements

I think we can go with your suggestion and use level here. Looking forward to the change!

[Tracking] Model loading/caching enhancements

Hi @germain-gg! I believe currently the downloads are resumable, as weights are broken into shards (e.g. ~105 shards for Llama-3-8B). For each shard that finishes downloading, it would be cached....

IndexedDB cache fails like the caches

Really appreciate the findings! Will update the doc

[Vue/Vite/Nuxt] Build Error: require is not defined in ES module scope, you can use import instead

This error should be addressed in npm 0.2.36. For details please see https://github.com/mlc-ai/web-llm/pull/397.

Error running the function calling example: Cannot find global function mlc.serve.BNFGrammarGetGrammarOfJSON

Function calling is a WIP and will be updated soon! Apologies for the inconvenience. All other examples should be working and feel free to try them out

Models output is scrambled in Safari Technology Preview, which has WebGPU support

Hi @mwyrzykowski, thanks for the support and sorry for the delayed response. WebLLM after npm 0.2.47 should fall back to `maxBufferSize` 256MB in case the 1024MB hits the limit: https://github.com/mlc-ai/web-llm/pull/498...

Error running the function calling example: Cannot find global function mlc.serve.BNFGrammarGetGrammarOfJSON

This weekend / early next week!