data-prep-kit icon indicating copy to clipboard operation
data-prep-kit copied to clipboard

Upgrade Docling2parquet to the latest version

Open shahrokhDaijavad opened this issue 7 months ago • 3 comments

Search before asking

  • [x] I searched the issues and found no similar issues.

Component

transforms/docling2parquet

Feature

Currently, docling2parquet in DPK uses:

docling-core==2.21.2
docling-ibm-models==3.4.1
docling-parse==3.4.0
docling==2.25.1
filetype >=1.2.0, <2.0.0

This needs to be updated, since docling is now quite a few versions ahead of these versions.

Are you willing to submit a PR?

  • [ ] Yes I am willing to submit a PR!

@dolfim-ibm Need your help with this.

shahrokhDaijavad avatar May 06 '25 18:05 shahrokhDaijavad

@shahrokhDaijavad Sir as I was discussing in #1220 in my last comment , I think the DPK's Docling2parquet refers to docling module which is getting installed while setting up the DPK . So technically if we just upgrade the version in requirements.txt it should be referring to the latest version module of the DPK I think .(yeah but not very sure)

ShiroYasha18 avatar May 06 '25 19:05 ShiroYasha18

Yes, @ShiroYasha18, but I am asking @dolfim-ibm for his help in doing this, so we know exactly what version to use and we can test locally, before adding to the next PyPi release.

shahrokhDaijavad avatar May 06 '25 19:05 shahrokhDaijavad

Understood sir! my apologies for jumping the gun.

ShiroYasha18 avatar May 06 '25 19:05 ShiroYasha18

resolved in #1332

swith005 avatar Jun 24 '25 18:06 swith005