rag-api-server icon indicating copy to clipboard operation
rag-api-server copied to clipboard

A RAG API server written in Rust following OpenAI specs

Results 15 rag-api-server issues
Sort by recently updated
recently updated
newest added

The current approach to search only the last user message for RAG content is too simplistic, especially in multi-turn conversations or in agentic apps where the agent automatically adds or...

enhancement

Pertaining [WasmEdge #3504](https://github.com/WasmEdge/WasmEdge/issues/3504) This PR aims to add search capabilities to the RAG API Server. This functionality will be enabled through the `search` feature. It will also leverage the [llamaedge-query-server](https://github.com/suryyyansh/llamaedge-query-server/)...

Are the RUSTFLAGS in [release.yml](https://github.com/LlamaEdge/rag-api-server/blob/main/.github/workflows/release.yml#L32) still required after the addition of the `.cargo` folder?

╰─# curl -X POST http://127.0.0.1:8080/v1/create/rag -F "[email protected]" ╰─# curl -X POST http://127.0.0.1:8080/v1/create/rag -F "[email protected]" 500 Internal Server Error: [qdrant] Failed to create collection: 409# It's expected to consume multiple files...

enhancement

When I run glm-4-9b-chat-Q5_K_M.gguf on the Cuda 12 machine, the API server can be started successfully. However, when I send a question, the API server will crash. The command I...

Hi, I followed steps in README but failed at the latest step. The return from the prompt is garbled text. Wish anyone can guide me to the right path. 🫠...

This looks like an incredible project that could be very useful. Have you given any thought to which license you wanted to release it under if any? I know this...

awaiting reply

I followed the instructions in readme.md. it built successfully i guess. but when i run `wasmedge rag-api-server.wasm -h`, i got the following errors: ``` [2024-05-29 18:44:18.672] [error] instantiation failed: unknown...

awaiting reply

after try step from readme ``` curl -X POST http://127.0.0.1:8080/v1/create/rag -F "[email protected]" ``` It took 590824.84 ms = nearly 1 minute for only chunking 306 lines (91KB) file on m3...

This pull request adds a [Dependabot](https://github.blog/2020-06-01-keep-all-your-packages-up-to-date-with-dependabot/) configuration file to enable automated dependency updates for the Cargo. The configuration is set to: - Check for updates daily. - Open up to...