pdfparser
pdfparser copied to clipboard
How to remove footer and sidebar when parsing text?
How to remove footer and sidebar in pdf parser to text?
@nitikachoudhary16 Since there are no fixed sidebar and footer areas in a PDF, I doubt if this can be done. However, if there are specific patterns in which the PDFs are built, those patterns can be used to identify footer and sidebar
I have observed space gap is coming in some words for example lets take a word "community" it is coming as comm unity.