flink icon indicating copy to clipboard operation
flink copied to clipboard

[FLINK-35825][hive] HiveTableSource supports report statistics for text file

Open reswqa opened this issue 1 year ago • 2 comments

What is the purpose of the change

HiveTableSource supports report statistics for text file

(NB. Most of the authorship belongs to @WencongLiu)

Brief change log

  • Introduce a TextFormatStatisticsReportUtil to estimate statistics for text fie.

Verifying this change

covered by ut.

Does this pull request potentially affect one of the following parts:

  • Dependencies (does it add or upgrade a dependency): no
  • The public API, i.e., is any changed class annotated with @Public(Evolving): no
  • The serializers: no
  • The runtime per-record code paths (performance sensitive): no
  • Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn, ZooKeeper: no
  • The S3 file system connector: no

Documentation

  • Does this pull request introduce a new feature? no

reswqa avatar Jul 12 '24 05:07 reswqa

CI report:

  • 1bf0b9c33e0baeb5cf33526c4d8f13cf36e8b2fa Azure: SUCCESS
Bot commands The @flinkbot bot supports the following commands:
  • @flinkbot run azure re-run the last Azure build

flinkbot avatar Jul 12 '24 05:07 flinkbot

@flinkbot run azure

reswqa avatar Oct 11 '24 01:10 reswqa

Thanks @xuyangzhong for the review, updated.

reswqa avatar Nov 21 '24 08:11 reswqa

Thanks for the review, rebased the latest master. I will merge this after CI green.

reswqa avatar Nov 25 '24 06:11 reswqa