Louis comments

Results 191 comments of


                                            Louis

feat: Jan RAG (PoC)

```shell curl -X POST 'http://127.0.0.1:3928/inferences/llamacpp/loadmodel' -H 'Content-Type: application/json' -d '{ "llama_model_path": "/Users/**/Downloads/ggml-model-q4_k.gguf", "mmproj": "/Users/**/Downloads/mmproj-model-f16.gguf", "ctx_len": 2048, "ngl": 100, "cont_batching": false, "embedding": false, "system_prompt": "", "user_prompt": "\n### Instruction:\n", "ai_prompt": "\n### Response:\n"...

bug: nightly-372 Regenerated answers are contaminated with thread title summary prompt

sus: latest nitro cache issue & gibberish response. Investigating

bug: UI not rendering for Wizard Coder [Model Checksum]

I think there is a problem with the downloaded model. The stats are unreal; I'm using an M2 Pro with 32 GB of RAM, but the token speed is around...

bug: UI not rendering for Wizard Coder [Model Checksum]

Sorry @SmokeShine, there isn't a checksum added yet to validate.

When using the GPU, is the model loaded into VRAM?

@RookHyena. Thank you for helping to lead the discussion. We've corrected the recommended tag based on RAM, VRAM, and GPU acceleration (on/off). There is also an `ngl` setting to configure...

bug: extremely slow on my VM since 0.4.4

> Hi, of course i tried it but it's the same behavior. it's still very very slow i have produced a report from CPUZ to analyse the caracteristics of my...

bug: no cache prompt on linux

Thank you. I think cache is disabled recently. Cc @tikikun

bug: no cache prompt on linux

We could enable caching from thread or app settings, but we should be cautious when switching between threads.

bug: no cache prompt on linux

As aligned, we will add settings for enabling/disabling the Nitro extension globally. @tikikun cc @namchuai @Inchoker

epic: Extension management

Attached #2210 as subtask.