Rag opensearch usecase with Beam's MLTransform
Added Insertion and Enrichment pipeline for RAG (Retrival Augmented Generation) usecase using OpenSearch vector database #GSOC-259
Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:
- [ ] Mention the appropriate issue in your description (for example:
addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, commentfixes #<ISSUE NUMBER>instead. - [ ] Update
CHANGES.mdwith noteworthy changes. - [ ] If this contribution is large, please file an Apache Individual Contributor License Agreement.
See the Contributor Guide for more tips on how to make review process smoother.
To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md
GitHub Actions Tests Status (on master branch)
See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.
R: @damccorm @jrmccluskey
Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control. If you'd like to restart, comment assign set of reviewers
I'd recommend rebasing the PR now that the initial RAG PR is merged
@jrmccluskey - thanks for the review suggestions! I have made changes as per your feedback, please let me know if these look good.
Code looks good, probably time to focus on cleaning up the notebook
Same note on rebasing the branch, see https://stackoverflow.com/questions/7929369/how-to-rebase-local-branch-onto-remote-master/18442755#18442755
@jrmccluskey - the branch seems upto date with master This branch has no conflicts with the base branch. Am I misunderstanding something?
@itsayushpandey I think this is the problem:
Your branch doesn't contain the most recent set of changes. To fix, you can:
git remote add upstream https://github.com/apache/beam.git
git fetch upstream
git rebase upstream/main
Hi @damccorm - the trailing commits of branch should have been updated. Can you please check if this looks good to go now?