data-prepper
data-prepper copied to clipboard
Track last scan time from before scan starts instead of based on last Modified of objects
Description
This change makes it so that the filtering done between scans for last modified objects, which is used to reduce lookups to the coordination store, is tracked based on the time before the listObjects call to scan for each bucket is made. This will make it impossible to miss items that are uploaded in the middle or end of a scan
Issues Resolved
Previously attempted to fix with https://github.com/opensearch-project/data-prepper/pull/4124
Original issue: https://github.com/opensearch-project/data-prepper/issues/4123
Check List
- [x] New functionality includes testing.
- [ ] New functionality has a documentation issue. Please link to it in this PR.
- [ ] New functionality has javadoc added
- [x] Commits are signed with a real name per the DCO
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license. For more information on following Developer Certificate of Origin and signing off your commits, please check here.