wllama issues

Results 16 wllama issues

Sort by recently updated

Made a function to build the Model URL Array when detecting the url has the gguf-split pattern `-<number>-of-<number>.gguf`. Would it fit in the lib?

Here's the function: ```typescript function parseModelUrl(url: string) { const urlPartsRegex = /(.*)-(\d{5})-of-(\d{5})\.gguf$/; const matches = url.match(urlPartsRegex); if (!matches || matches.length !== 4) return url; const baseURL = matches[1]; const paddedShardsAmount...

felladrin

add Blob support

Resolves #42 Resolves #43 `loadModel()` now also accepts `Blob` or `File` TODO: add example

ngxson

Feature request: diversify error mesages when loading a model fails

Currently a model can fail to load for a number of different reasons. However, the error raised seems to always be a general "failed to load" error. It would be...

flatsiedatsie

404-ed model still ended up in wllama_cache?

Because I [made a typo](https://github.com/ngxson/wllama/issues/56) in the URL of a local model file I noticed something strange. It seems that invalid URL ended up in the `wllama_cache` anyway. I checked...

flatsiedatsie

After upgrading to version 1.8.0, the async function `loadModelFromUrl` is not completing when using large models

Something interesting occurred while upgrading to version 1.8.0. Previously, it had been throwing an "Out of Memory" error, but that issue has now been resolved. However, a new problem has...

felladrin

[Idea] Publish to JSR

The "next generation of node package manager" ==> https://jsr.io/

ngxson

How would you implement RAG / Document chat?

In your readme you mention: > Maybe doing a full RAG-in-browser example using tinyllama? I've been looking into a way to allow users to 'chat with their documents'. A popular...

flatsiedatsie

[Idea] Stream data from main thread to worker

~~Data is now passing as `Uint8Array`. We can do better by using Streams: https://developer.mozilla.org/en-US/docs/Web/API/Streams_API/Using_readable_streams~~ We are now using `Blob` which already provides a `ReadableStream`.

ngxson

performance expectations

First, thanks for putting this project together! I modified `examples/basic/index.html` to use a more capable model: `https://huggingface.co/lmstudio-ai/gemma-2b-it-GGUF/resolve/main/gemma-2b-it-q4_k_m.gguf`, which is 1.5gb. Using [LM Studio](https://lmstudio.ai) on my laptop (with GPU Acceleration disabled),...

chadkirby

[Idea] Load model from File Blob

With the introduction of heapfs, we can now do more low-level things. The idea is to load File Blob directly into wllama's heap without creating any intermediate buffer. This will...

ngxson

wllama
wllama copied to clipboard

Metadata

Made a function to build the Model URL Array when detecting the url has the gguf-split pattern `-<number>-of-<number>.gguf`. Would it fit in the lib?

add Blob support

Feature request: diversify error mesages when loading a model fails

404-ed model still ended up in wllama_cache?

After upgrading to version 1.8.0, the async function `loadModelFromUrl` is not completing when using large models

[Idea] Publish to JSR

How would you implement RAG / Document chat?

[Idea] Stream data from main thread to worker

performance expectations

[Idea] Load model from File Blob

← Metadata

Owner

Metadata

wllama wllama copied to clipboard

Metadata

← Metadata

Owner

Metadata

wllama
wllama copied to clipboard