rag-api-server
rag-api-server copied to clipboard
A RAG API server written in Rust following OpenAI specs
The current approach to search only the last user message for RAG content is too simplistic, especially in multi-turn conversations or in agentic apps where the agent automatically adds or...
Pertaining [WasmEdge #3504](https://github.com/WasmEdge/WasmEdge/issues/3504) This PR aims to add search capabilities to the RAG API Server. This functionality will be enabled through the `search` feature. It will also leverage the [llamaedge-query-server](https://github.com/suryyyansh/llamaedge-query-server/)...
Are the RUSTFLAGS in [release.yml](https://github.com/LlamaEdge/rag-api-server/blob/main/.github/workflows/release.yml#L32) still required after the addition of the `.cargo` folder?
╰─# curl -X POST http://127.0.0.1:8080/v1/create/rag -F "[email protected]" ╰─# curl -X POST http://127.0.0.1:8080/v1/create/rag -F "[email protected]" 500 Internal Server Error: [qdrant] Failed to create collection: 409# It's expected to consume multiple files...
When I run glm-4-9b-chat-Q5_K_M.gguf on the Cuda 12 machine, the API server can be started successfully. However, when I send a question, the API server will crash. The command I...
Hi, I followed steps in README but failed at the latest step. The return from the prompt is garbled text. Wish anyone can guide me to the right path. 🫠...
This looks like an incredible project that could be very useful. Have you given any thought to which license you wanted to release it under if any? I know this...
I followed the instructions in readme.md. it built successfully i guess. but when i run `wasmedge rag-api-server.wasm -h`, i got the following errors: ``` [2024-05-29 18:44:18.672] [error] instantiation failed: unknown...
after try step from readme ``` curl -X POST http://127.0.0.1:8080/v1/create/rag -F "[email protected]" ``` It took 590824.84 ms = nearly 1 minute for only chunking 306 lines (91KB) file on m3...
This pull request adds a [Dependabot](https://github.blog/2020-06-01-keep-all-your-packages-up-to-date-with-dependabot/) configuration file to enable automated dependency updates for the Cargo. The configuration is set to: - Check for updates daily. - Open up to...