Feat/enrich page number to partitions
Motivation and Context (Why the change? What's the scenario?)
- Include page number information in Text Partition.
High level description (Approach, Design)
- Implement by the simple way
- Reuse ExtractedContent from GeneratedFileDetails.
- Create chunks based on ExtractedContent instead of ExtractedText, to reference the page number.
- This way can keep page number info in a simple way but can lead to unoptimized chunks' size.
Hi, Just checking in on this — any updates on the review process?
This feature would be a great addition to Kernel Memory. It's something we’re really looking forward to, as it could unlock some essential capabilities in real-world scenarios. Thanks again for your work on this!
Hi @dluc @lunmatu101, do you have any plans to merge this PR? I'd really appreciate having this functionality available.
@lunmatu101 please read the following Contributor License Agreement(CLA). If you agree with the CLA, please reply with the following information.
@microsoft-github-policy-service agree [company="{your company}"]Options:
- (default - no company specified) I have sole ownership of intellectual property rights to my Submissions and I am not making Submissions in the course of work for my employer.
@microsoft-github-policy-service agree
- (when company given) I am making Submissions in the course of work for my employer (or my employer has intellectual property rights in my Submissions by contract or applicable law). I have permission from my employer to make Submissions and enter into this Agreement on behalf of my employer. By signing below, the defined term “You” includes me and my employer.
@microsoft-github-policy-service agree company="Microsoft"Contributor License Agreement
@microsoft-github-policy-service agree
Since this update seems to be interesting to the community, I opened this PR for review from the owner.
Hi @dluc can you take a look to this PR?
Would also really appreciate if this PR would be accepted
Hi @dluc, Could you take a look at this PR? We really need this feature urgently. Thanks
Closing as part of repository maintenance - no further action planned on this issue.