data-prep-kit icon indicating copy to clipboard operation
data-prep-kit copied to clipboard

Deprecation message displayed if a user wants to use old pdf2parquet

Open shahrokhDaijavad opened this issue 8 months ago • 1 comments

Search before asking

  • [x] I searched the issues and found no similar issues.

Component

transforms/pdf2parquet

Feature

If a user tries to use from dpk_pdf2parquet.transform_python import Pdf2Parquet e.g., in a notebook, we show a deprecation message and point the user to use dpk_docling2parquet.transform_python import docling2Parquet instead.

Questions:

  1. Keep the skeleton of pdf2parquet transform, with this file in it: https://github.com/data-prep-kit/data-prep-kit/blob/dev/transforms/language/pdf2parquet/dpk_pdf2parquet/transform_python.py, implementing the deprecation?
  2. Follow this link (https://typing.python.org/en/latest/spec/directives.html#deprecated) to implement deprecation?

Are you willing to submit a PR?

  • [ ] Yes I am willing to submit a PR!

shahrokhDaijavad avatar Apr 03 '25 23:04 shahrokhDaijavad

cc: @swith005 I am just creating this issue for discussion before we implement anything.

shahrokhDaijavad avatar Apr 03 '25 23:04 shahrokhDaijavad

pdf2parquet no longer referenced in library, so nothing to deprecate in latest (would have to update on an older version).

solution should be to update to latest and use docling2parquet.

swith005 avatar Jun 24 '25 18:06 swith005