web-llm issues

HI,

HI, For running the web-llm in browser I've tried a NVIDIA T2000 and a build in i9 intel GPU. The T2000 is fastest in the prompt ingestion at ~17 tokens/s...

Marquis2125

Whether to add p2p download support

Whether to add p2p download support, after all, the models are too large to download for web user, webGPU should be inseparable from p2p, unless a powerful enterprise can support...

novvoo

Whisper in web-llm with WebGPU?

4

Great Repository! Is it within your scope to implement a webGPU accelerated version of Whisper? Not sure if this helps, but there is a [C port for Whisper wirh CPU...

sandorkonya

[embeddings] Any plans for adding other transformer models like sentence-transformers?

3

Would be really nice to have WebGPU support for running other transformer models like sbert and embeddings models. For example, here's [transformer.js](https://xenova.github.io/transformers.js/) Thanks! @jinhongyii

jlia0

How to use WizardLM?

4

Excited to see support added for other models like WizardLM in https://github.com/mlc-ai/web-llm/pull/75. As I don't have the hardware to build this, would it be possible to run the GitHub Action...

DustinBrett

Chat demo is not working in Chrome 113.

7

I got following errors on your demo page. ``` Find an error initializing the WebGPU device Error: Cannot find adapter that matches the request Init error, Error: Find an error...

lcw99

Performance on NVIDIA GPU (discrete) seems to be much worse than AMD (integrated) GPU - is that expected?

2

I have an integrated AMD GPU (512 MB dedicated memory and 11.6GB shared memory) and a discreet NVIDIA GPU (6GB dedicated memory and 11.6 GB shared). Here are the results...

armsp

RuntimeError: Cannot find libraries: wasm_runtime.bc

1

failed to find library when building model ``` Traceback (most recent call last): File "/Users/wangxj/web-llm/build.py", line 200, in build(mod, ARGS) File "/Users/wangxj/web-llm/build.py", line 174, in build ex.export_library(os.path.join(args.artifact_path, output_filename)) File "/Users/wangxj/web-llm/venv/lib/python3.9/site-packages/tvm/relax/vm_build.py",...

jan-www

Hi, I made a new website for web-llm!

2

Hi there! I want to share that I've been enjoying using web-llm and its demo for some time now. However, I found that the demo didn't quite meet my needs...

Ryan-yang125

webGL needed

maybe,webGL support is a better choice. and gpu.js can be used for this [gpu.js](https://github.com/gpujs/gpu.js)

mccoysc

web-llm
web-llm copied to clipboard

Metadata

HI,

Whether to add p2p download support

Whisper in web-llm with WebGPU?

[embeddings] Any plans for adding other transformer models like sentence-transformers?

How to use WizardLM?

Chat demo is not working in Chrome 113.

Performance on NVIDIA GPU (discrete) seems to be much worse than AMD (integrated) GPU - is that expected?

RuntimeError: Cannot find libraries: wasm_runtime.bc

Hi, I made a new website for web-llm!

webGL needed

← Metadata

Owner

Metadata

web-llm web-llm copied to clipboard

Metadata

← Metadata

Owner

Metadata

web-llm
web-llm copied to clipboard