hive icon indicating copy to clipboard operation
hive copied to clipboard

HIVE-28490: Stop removing retainable DPP sources

Open ngsg opened this issue 1 year ago • 3 comments

What changes were proposed in this pull request?

This patch changes the implementation of SharedWorkOptimizer#findAscendantWorkOperators. For a given operator, original findAscendantWorkOperators() returns a set of operators that belongs to ancestor vertices of the vertex that given operator belongs to. Therefore, some operators that are irrelevant to the given operator also belong to the resultant set, which leads to wrong DPP source removal. The new implementation returns a set of ancestor operators of the given operator. Please check out the slides attached in the JIRA page (HIVE-28490) for further explanations.

Why are the changes needed?

In order to prevent the removal of retainable DPP sources that are still useful in runtime.

Does this PR introduce any user-facing change?

No

Is the change a dependency upgrade?

No

How was this patch tested?

We tested this patch by running all TPC-DS queries in cluster. This patch contains a qfile test that reproduces the wrong DPP source removal.

ngsg avatar Aug 30 '24 05:08 ngsg

tips: HIVE-24890 => HIVE-28490

BsoBird avatar Aug 30 '24 09:08 BsoBird

@BsoBird , thanks for pointing out the typo. I fixed it.

ngsg avatar Sep 02 '24 00:09 ngsg

thanks for the fix @ngsg! sorry for the late reply. I've added a few comments

deniskuzZ avatar Oct 28 '24 15:10 deniskuzZ

@ramesh0201, @kasakrisz, could you please take a look at this PR as a second pair of eyes? thanks!

deniskuzZ avatar Oct 29 '24 08:10 deniskuzZ

ok, since there were no other reviews, I am merging this PR

deniskuzZ avatar Nov 19 '24 11:11 deniskuzZ