katib icon indicating copy to clipboard operation
katib copied to clipboard

feat(rag): Auto-RAG

Open gaocegege opened this issue 10 months ago • 12 comments

/kind feature

Ref https://arxiv.org/pdf/2404.01037.pdf

Auto-RAG: The idea of automatically optimizing RAG systems, akin to Auto-ML’s approach in traditional machine learning, presents a significant opportunity for future exploration. Currently, selecting the optimal configuration of RAG components — e.g., chunking strategies, window sizes, and parameters within rerankers — relies on manual experimentation and intuition. An automated system could systematically explore a vast space of RAG configurations and select the very best model (Markr.AI, 2024).

RAG requires some hyperparameters e.g. chunking strategies, and window sizes for sentence window retrieval. It should be done automatically.


Love this feature? Give it a 👍 We prioritize the features with the most 👍

gaocegege avatar Apr 10 '24 08:04 gaocegege

Maybe we could add an example to showcase how to use Katib and LlamaIndex to AutoRAG.

Not sure if there is any new feature to be implemented.

gaocegege avatar Apr 11 '24 02:04 gaocegege

Related: https://github.com/Marker-Inc-Korea/AutoRAG

gaocegege avatar Apr 11 '24 02:04 gaocegege

Are you thinking of adding an example that uses the proposed tuning API for LLMs to demonstrate Auto-RAG?

tariq-hasan avatar Apr 11 '24 05:04 tariq-hasan

@tariq-hasan It should work. But I do not have the bandwidth for it. I'm simply presenting the idea for consideration at this point.

gaocegege avatar Apr 11 '24 05:04 gaocegege

Thanks for creating this @gaocegege. Are there any differences to optimize these HPs for RAG (e.g. chunking strategies and window sizes) compare to our current optimization flow with Experiment -> Suggestion -> Trials? I guess, Trials can consume prompt and produce the metrics.

andreyvelich avatar Apr 11 '24 21:04 andreyvelich

The workflow should be similar. I think. We could make a demo based on llama index to see if there is anything we miss.

gaocegege avatar Apr 12 '24 02:04 gaocegege

Hi! I'm the developer of AutoRAG. Do you still interested in implement AutoRAG or use it? Make demo for this? We are open for any kind of collaboration.

vkehfdl1 avatar Jul 09 '24 04:07 vkehfdl1

Nice to meet you @vkehfdl1! Sure, that would be great, maybe you can attend one of our upcoming AutoML and Training WG community calls to give a demo and we can discuss how we can collaborate. cc @kubeflow/wg-training-leads

andreyvelich avatar Jul 09 '24 18:07 andreyvelich

Hi @andreyvelich Nice to meet you.

First, It will be hard to attend the community call today because the timezone. It is 2:00 a.m. here so hard to attend. Maybe other community call two weeks later in 2:00 UTC can be fine, or we can book another call.

Thanks!

vkehfdl1 avatar Jul 10 '24 00:07 vkehfdl1

Sure, that sounds great! I added you to the meeting agenda on July 24th.

andreyvelich avatar Jul 10 '24 10:07 andreyvelich

Hi @vkehfdl1, just a reminder that our community call starts in 10 minutes, if you want to give AutoRAG demo.

andreyvelich avatar Jul 24 '24 13:07 andreyvelich

/area llm

andreyvelich avatar Aug 21 '24 14:08 andreyvelich