[Bug] pdf_processing_1_python.ipynb is not working, complaining for ModuleNotFoundError: No module named 'humanfriendly'
Search before asking
- [x] I searched the issues and found no similar issues.
Component
Notebooks and Recipe
What happened + What you expected to happen
Notebook - https://github.com/data-prep-kit/data-prep-kit/blob/dev/examples/pdf-processing-1/pdf_processing_1_python.ipynb
Step 3.1 -Download Data is failing with error,
ModuleNotFoundError: No module named 'humanfriendly'
Reproduction script
Run Notebook - https://github.com/data-prep-kit/data-prep-kit/blob/dev/examples/pdf-processing-1/pdf_processing_1_python.ipynb
Anything else
No response
OS
MacOS
Python
3.10
Are you willing to submit a PR?
- [x] Yes I am willing to submit a PR!
seems like some issue with humanfriendly package, it is not found event if it is installed
@santoshborse, There are multiple problems associated with this notebook, as you can see in the PR #1259. Last week, @sujee, who originally developed this notebook, asked us to re-engage, and we asked him to review this PR and make the necessary fixes. The issue you have raised can be added to that PR. We also created an up-to-date version of this notebook that we maintain here: https://github.com/data-prep-kit/data-prep-kit/blob/dev/recipes/DPK-sequence/pdf_processing_python.ipynb, and in that, you can see that humanfriendly gets pip installed in Step 1. If you want to use the pipeline used here, please use the one in the recipes folder.
@santoshborse @shahrokhDaijavad will work on this soon.