gobblin
gobblin copied to clipboard
[GOBBLIN-656] Add filtering of table format in HiveDatasetFinder
Dear Gobblin maintainers,
JIRA
- [x] My PR addresses the following Gobblin JIRA issues and references them in the PR title. For example, "[GOBBLIN-656] My Gobblin PR"
- https://issues.apache.org/jira/browse/GOBBLIN-656
Description
- [x] In some of pipelines using
HiveDatasetFinderwe are asked to have functionalities to reject specific types of tables. The rejection criteria could include the table format which is reflected from SerDe library it uses.
Tests
- [x] Unit Tested by the added case.
Commits
- [x] My commits all reference JIRA issues in their subject lines, and I have squashed multiple commits if they address the same issue. In addition, my commits follow the guidelines from "How to write a good git commit message":
- Subject is separated from body by a blank line
- Subject is limited to 50 characters
- Subject does not end with a period
- Subject uses the imperative mood ("add", not "adding")
- Body wraps at 72 characters
- Body explains "what" and "why", not "how"
@htran1 Can you help review? Thanks.