node-llama-cpp issues

Loading Llama3 in Electron

1

### Issue description Electron crashes when loadModel finishes loading (beta) ### Expected Behavior After loading a model using this code and trying to create a context, I'd expect there not...

bitterspeed

bug

feat: React Native support

3

### Feature Description Is it possible for `node-llama-cpp` to support React Native projects? I know that this project depends on many node standard libraries, but is it possible to make...

raymon-io

new feature

When I use node-llama-cpp to run inference, cloudrun fails with a 503 error

2

### Issue description When I use node-llama-cpp to run inference, cloudrun fails with a 503 error. ### Expected Behavior Run inference in cloudrun without any issues. ### Actual Behavior I...

MarioSimou

bug

requires triage

Build failed on Linux with Cuda 12.8

### Issue description Failed to build with the latest version of Cuda. When using prebuilt binaries it says incompatible. ### Expected Behavior No errors with invoking `getLlama` ### Actual Behavior...

ruochenjia

bug

requires triage

Example Electron app to chat with models using a UI does not work

### What was unclear or otherwise insufficient? Downloading and running .exe under this section: ![Image](https://github.com/user-attachments/assets/1a46eaf9-911a-4466-8d96-1d6b9b0d9542) Cause the error on run: ![Image](https://github.com/user-attachments/assets/032405e8-3689-4dc6-9c59-c11233d3726e) ### Recommended Fix Fix the .exe file ### Additional...

davayd

documentation

requires triage

feat: initial TTS support

Will be added once `llama.cpp`'s TTS APIs are finalized and [Orpheus](https://github.com/canopyai/Orpheus-TTS) ([issue](https://github.com/ggml-org/llama.cpp/issues/12476)) or [SparkTTS](https://github.com/SparkAudio/Spark-TTS) ([issue](https://github.com/ggml-org/llama.cpp/issues/12495)) models are supported.

giladgd

new feature

roadmap

bug: Qwen Embedded doesn't work.

4

### Issue description `Qwen3-Embedding-8B-Q4_K_M.gguf` (downloaded from huggingface) works on `gpu: false` (cpu) but not when run on vulkan. ### Expected Behavior `await embedContext.getEmbeddingFor(...)` returns successfully, or throws an error that...

Griffork

bug

requires triage

feat: Support for T5 architecture UGM vocab type

1

### Feature Description _First of all, huge thanks for developing this addon. And I try to not disturb anyone by asking for features. This is literally my very first Github...

Kutalia

new feature

roadmap

feat: builtin ROCm support

1

### Feature Description I can't reproduce my existing AMD hipBLAS build: ``` export ROCM_PATH=/usr/lib64/rocm export ROCM_DEVICE_LIB_PATH=$ROCM_PATH/llvm/lib/clang/18/amdgcn/bitcode cmake -S . -B build \ -DGGML_HIP=ON \ -DAMDGPU_TARGETS=gfx1030 \ -DCMAKE_BUILD_TYPE=Release \ -DCMAKE_C_COMPILER=hipcc \...

ExposedCat

new feature

help wanted

feat: support sycl gpu on windows

2

### Feature Description current on windows platform, cuda and vulkan are supported. is there a roadmap to support sycl? I think it will improve the performance especially on intel's gpu....

makejiang

new feature

node-llama-cpp
node-llama-cpp copied to clipboard

Metadata

Loading Llama3 in Electron

feat: React Native support

When I use node-llama-cpp to run inference, cloudrun fails with a 503 error

Build failed on Linux with Cuda 12.8

Example Electron app to chat with models using a UI does not work

feat: initial TTS support

bug: Qwen Embedded doesn't work.

feat: Support for T5 architecture UGM vocab type

feat: builtin ROCm support

feat: support sycl gpu on windows

← Metadata

Owner

Metadata

node-llama-cpp node-llama-cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

node-llama-cpp
node-llama-cpp copied to clipboard