pyresparser icon indicating copy to clipboard operation
pyresparser copied to clipboard

no parsing done for tables in the resume pdf/doc

Open annapurnarelan20 opened this issue 6 years ago • 4 comments

Hi, Have been trying to run the parser with the resumes containing data in tabular format like skills or experience in the resume is listed in a table , that information is skipped and is not parsed by the parser.

can you help in correcting the issue.

annapurnarelan20 avatar Aug 21 '19 10:08 annapurnarelan20

Textract and pdfminer find it hard to read tables. You can try something like: https://blog.chezo.uno/tabula-py-extract-table-from-pdf-into-python-dataframe-6c7acfa5f302

OmkarPathak avatar Aug 27 '19 09:08 OmkarPathak

@annapurnarelan20 can you provide a sample resume so that I can use it for testing purposes

OmkarPathak avatar Oct 03 '19 14:10 OmkarPathak

Hi, PFA a pdf resume with tabular format data. Sorry for late reply!

Thanks! Annapurna Relan

On Thu, Oct 3, 2019 at 7:40 PM Omkar Pathak [email protected] wrote:

@annapurnarelan20 https://github.com/annapurnarelan20 can you provide a sample resume so that I can use it for testing purposes

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/OmkarPathak/pyresparser/issues/5?email_source=notifications&email_token=ALH6MNLP77JSL2SGZKMUOMLQMX4ORA5CNFSM4IOFIQP2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAIKTFA#issuecomment-537962900, or mute the thread https://github.com/notifications/unsubscribe-auth/ALH6MNKD2B35GFRSP5IUAMTQMX4ORANCNFSM4IOFIQPQ .

annapurnarelan20 avatar Oct 14 '19 07:10 annapurnarelan20

Hey,

I just signed the petition "Sushant Singh Rajput: Boycott Karan Johar, YRF films, Salman Khan" and wanted to see if you could help by adding your name.

Our goal is to reach 3,000,000 signatures and we need more support. You can read more and sign the petition here:

http://chng.it/42Kn9G6mLt

Thanks! annapurna

annapurnarelan20 avatar Jun 18 '20 06:06 annapurnarelan20