Add option to skip corrupt PDFs in PDFMergerUtility with improved exception handling

Open SwethaMuthuvel opened this issue 6 months ago • 2 comments

What This PR Does

This pull request improves the robustness and debuggability of PDFMergerUtility by:

Adding a skipCorruptFiles flag
- Allows users to skip unreadable or corrupt PDF files during merge.
- Default behavior remains unchanged (i.e., throws on error).
Wrapping IOException with source context
- Converts vague errors like:
```
IOException: Could not parse object stream
```
  into more useful messages like:
```
IOException: Failed to load PDF from source: /path/to/file.pdf
```
- Helps identify exactly which file failed.
Applied consistently in both merge modes
- optimizedMergeDocuments(...)
- legacyMergeDocuments(...)
- Added warning logs when skipping files.

Jul 04 '25 07:07 SwethaMuthuvel