data-prep-kit icon indicating copy to clipboard operation
data-prep-kit copied to clipboard

[Bug] pdf_processing_1_python.ipynb is not working, complaining for ModuleNotFoundError: No module named 'humanfriendly'

Open santoshborse opened this issue 2 months ago • 3 comments

Search before asking

  • [x] I searched the issues and found no similar issues.

Component

Notebooks and Recipe

What happened + What you expected to happen

Notebook - https://github.com/data-prep-kit/data-prep-kit/blob/dev/examples/pdf-processing-1/pdf_processing_1_python.ipynb

Step 3.1 -Download Data is failing with error,

ModuleNotFoundError: No module named 'humanfriendly'

Reproduction script

Run Notebook - https://github.com/data-prep-kit/data-prep-kit/blob/dev/examples/pdf-processing-1/pdf_processing_1_python.ipynb

Anything else

No response

OS

MacOS

Python

3.10

Are you willing to submit a PR?

  • [x] Yes I am willing to submit a PR!

santoshborse avatar Oct 03 '25 14:10 santoshborse

seems like some issue with humanfriendly package, it is not found event if it is installed

Image

santoshborse avatar Oct 03 '25 14:10 santoshborse

@santoshborse, There are multiple problems associated with this notebook, as you can see in the PR #1259. Last week, @sujee, who originally developed this notebook, asked us to re-engage, and we asked him to review this PR and make the necessary fixes. The issue you have raised can be added to that PR. We also created an up-to-date version of this notebook that we maintain here: https://github.com/data-prep-kit/data-prep-kit/blob/dev/recipes/DPK-sequence/pdf_processing_python.ipynb, and in that, you can see that humanfriendly gets pip installed in Step 1. If you want to use the pipeline used here, please use the one in the recipes folder.

shahrokhDaijavad avatar Oct 06 '25 16:10 shahrokhDaijavad

@santoshborse @shahrokhDaijavad will work on this soon.

sujee avatar Oct 09 '25 18:10 sujee