Added insertion and enrichment pipeline
WIP: Added insertion and enrichment pipeline for RAG (Retrival Augmented Generation) usecase #GSOC-259
Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:
- [ ] Mention the appropriate issue in your description (for example:
addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, commentfixes #<ISSUE NUMBER>instead. - [ ] Update
CHANGES.mdwith noteworthy changes. - [ ] If this contribution is large, please file an Apache Individual Contributor License Agreement.
See the Contributor Guide for more tips on how to make review process smoother.
To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md
GitHub Actions Tests Status (on master branch)
See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.
@damccorm @jrmccluskey
R: @damccorm @jrmccluskey
Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control
The notebook still needs some output truncated for readability and has some empty cells at the bottom that can be removed.
@jrmccluskey I have added the changes you suggested and I will truncate the outputs after incorporating opensearch changes.