web-llm issues

The model is written "weird things" after few questions

3

Hello I have an Intel and Nvidia card, so I rebuilt the tvm bundle to have the "high-performance" change. I noticed that when the model starts to write "weird things",...

kadogo

Run llama.cpp models

5

I guess it would be easy for you run the ggml [llama.cpp](https://github.com/ggerganov/llama.cpp) compatible models. In this case, you don't need the GPU and could run the models in memory. From...

MariasStory

Where is the source code for vicuna-7b_webgpu.wasm?

6

Where is the source code for vicuna-7b_webgpu.wasm, please? Thank you.

zeritonius

One file vs. shards - is there a difference in performance?

3

Has anyone tried to combine all 163 shards into one file? If yes, was it a difference in performance? Thank you.

zeritonius

Cannot find adapter on Microsoft Edge

Hey folks, amazing work. However, FYI this does not work on Microsoft Edge (running on Linux Fedora 37, with an Nvidia 1080), which is a shame. ![ksnip_20230418-134843](https://user-images.githubusercontent.com/518555/232768299-81de4a9d-9c91-450d-b635-0b157d309eea.png) Using `edge://gpu` I...

mysticaltech

Add Dropdown Menu for Selecting Specific LLM Model

1

Adding a dropdown menu to the platform will allow users to easily select the LLM they want to use along with a brief description of its features. This will improve...

MrAnayDongre