ms-agent
ms-agent copied to clipboard
[WIP]feature: support more general image extraction
Change Summary
- Add examples in unittest.
- Support capturing current base url in docling htmlbackend.
- Support fecthing image using base url.
- Support normalizing base url.
- Rename the validate_url function to resolve_url.
- Slightly repackage image extraction logic.
Related issue number
Checklist
- [ ] The pull request title is a good summary of the changes - it will be used in the changelog
- [ ] Unit tests for the changes exist
- [ ] Run
pre-commit installandpre-commit run --all-filesbefore git commit, and passed lint check. - [ ] Some cases need DASHSCOPE_TOKEN_API to pass the Unit Tests, I have at least pass the Unit tests on local
- [ ] Documentation reflects the changes where applicable
- [ ] My PR is ready to review, please add a comment including the phrase "please review" to assign reviewers