node-llama-cpp issues

LlamaCpp crash when embedding (in beta)

3

### Issue description LlamaCpp crash when embedding ### Expected Behavior The code generates correct embedding vector. ### Actual Behavior LlamaCpp crashed with error code: ``` zsh: segmentation fault node server/test.js...

vodkaslime

bug

Issue with Webpack Compilation

8

### Issue description When bundling `node-llama-cpp` with webpack and Typescript, there's something weird happening: Webpack somehow appears to load the module as a promise. After that is resolved, everything works...

nathanlesage

bug

requires triage

feat: version 3.0

19

## How to use this beta To install the beta version of `node-llama-cpp`, run this command inside of your project: ```bash npm install node-llama-cpp@beta ``` To get started quickly, generate...

giladgd

feat: `create-project` command to make it easier to bootstrap a new project

Support creating these types of projects: * Node with TypeScript using `vite-node` * Electron app with TypeScript * Node with plain JavaScript

giladgd

new feature

roadmap

Support file based prompt caching

11

### Feature Description LlamaCPP is able to cache prompts to a specific file via the "--prompt-cache" flag. I think that exposing this through node-llama-cpp would provide for some techniques for...

StrangeBytesDev

new feature

roadmap

feat: automatic batching

8

Also, automatically set the right `contextSize` and provide other good defaults to make the usage smoother. * Support configuring the context swapping size for infinite text generation (by default, it'll...

giladgd

new feature

roadmap

released on @beta

feat: pass an image as part of the evaluation

4

When `llama.cpp`'s support for this will be stable. Hopefully, there will be an official API for this after https://github.com/ggerganov/llama.cpp/issues/9643 is implemented.

giladgd

new feature

roadmap

feat: Apply different LoRA dynamically

2

### Feature Description Can change LoRA dynamically after loading LLaMa model. ### The Solution See `llama_model_apply_lora_from_file()` function in `llama.cpp`. https://github.com/ggerganov/llama.cpp/blob/e9c13ff78114af6fc6a4f27cc8dcdda0f3d389fb/llama.h#L353C1-L359C1 ### Considered Alternatives None. ### Additional Context _No response_ ###...

snowyu

new feature

requires triage

feat: use multiple machines for evaluating large models that require a lot of RAM

giladgd

new feature

roadmap

feat: split gguf files support

### Description of change * feat: split gguf files support * feat: `pull` command * feat: `stopOnAbortSignal` and `customStopTriggers` on `LlamaChat` and `LlamaChatSession` * feat: `checkTensors` parameter on `loadModel` *...

giladgd

node-llama-cpp
node-llama-cpp copied to clipboard

Metadata

LlamaCpp crash when embedding (in beta)

Issue with Webpack Compilation

feat: version 3.0

feat: `create-project` command to make it easier to bootstrap a new project

Support file based prompt caching

feat: automatic batching

feat: pass an image as part of the evaluation

feat: Apply different LoRA dynamically

feat: use multiple machines for evaluating large models that require a lot of RAM

feat: split gguf files support

← Metadata

Owner

Metadata

node-llama-cpp node-llama-cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

node-llama-cpp
node-llama-cpp copied to clipboard