MinerU
MinerU copied to clipboard
Could you provide word boxes' boundaries in the output json files?
Does MinerU provide the box boundaries for each word in the PDF in the output JSON files? If not, could you provide it? This information should be already available in the OCR engine used by MinerU.