jdk icon indicating copy to clipboard operation
jdk copied to clipboard

8327381: Refactor type-improving transformations in BoolNode::Ideal to BoolNode::Value

Open tabjy opened this issue 11 months ago • 18 comments

This PR resolves JDK-8327381

Currently the transformations for expressions with patterns ((x & m) u<= m) or ((m & x) u<= m) to true is in BoolNode::Ideal function with a new constant node of value 1 created. However, this is technically a type-improving (reduction in range) transformation that's better suited in BoolNode::Value function.

New unit test test/hotspot/jtreg/compiler/c2/TestBoolNodeGvn.java asserting on IR nodes and correctness of this transformation is added and passing.


Progress

  • [x] Change must not contain extraneous whitespace
  • [x] Commit message must refer to an issue
  • [ ] Change must be properly reviewed (2 reviews required, with at least 2 Reviewers)

Issue

  • JDK-8327381: Refactor type-improving transformations in BoolNode::Ideal to BoolNode::Value (Enhancement - P5)

Reviewers

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/18198/head:pull/18198
$ git checkout pull/18198

Update a local copy of the PR:
$ git checkout pull/18198
$ git pull https://git.openjdk.org/jdk.git pull/18198/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 18198

View PR using the GUI difftool:
$ git pr show -t 18198

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/18198.diff

Webrev

Link to Webrev Comment

tabjy avatar Mar 11 '24 14:03 tabjy

:wave: Welcome back kxu! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

bridgekeeper[bot] avatar Mar 11 '24 16:03 bridgekeeper[bot]

@tabjy The following label will be automatically applied to this pull request:

  • hotspot-compiler

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command.

openjdk[bot] avatar Mar 11 '24 16:03 openjdk[bot]

Thanks @jaskarth and @eme64 for the review. I've pushed a new commit to address the following:

  • Updated license header year to 2024
  • Explicit nullptr comparison
  • Node* var for pointer types
  • Test moved to c2.irTests, added @bug and @summary tags

tabjy avatar Mar 12 '24 18:03 tabjy

Oops. Package name updated. Sorry for such a rookie mistake!

tabjy avatar Mar 13 '24 02:03 tabjy

@tabjy This change now passes all automated pre-integration checks.

ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details.

After integration, the commit message for the final commit will be:

8327381: Refactor type-improving transformations in BoolNode::Ideal to BoolNode::Value

Reviewed-by: chagedorn, thartmann, jkarthikeyan, epeter

You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed.

At the time when this comment was updated there had been 28 new commits pushed to the master branch:

  • 32c975098521e830ce706b67e7232a007c0846c7: 8339160: [BACKOUT] JDK-8338440 Parallel: Improve fragmentation mitigation in Full GC
  • 9d183bd02763ee4ff5aa8388e039d8b5a6964328: 8339149: jfr_flush_event_writer - return value type mismatch
  • 2150521650d6b730cfe9d3ecb91d589c96862475: 8322036: Improve help output from the javadoc tool
  • 1ff9ac7233d51a58fd54a92d2c45761478574cc7: 8338731: MemoryLayout::offsetHandle can return a negative offset
  • 2e174c6367c7755d8541f9669f7f10ed89878f58: 8338445: jdk.internal.loader.URLClassPath may leak JarFile instance when dealing with unexpected Class-Path entry in manifest
  • 8e88da05b9966892e117b779d59a2e311a557a8d: 8338041: Keyboard Navigation of JTable, Ctrl Shift RIGHT/LEFT doesn't follow native action in GTK L&F
  • 449ca2c3c1cb5d056a2d259be2ff069ba2a36b80: 8337832: Optimize datetime toString
  • b1b4cd429a4135840966975dd0c068fe428e2ee6: 8332158: [XWayland] test/jdk/java/awt/Mouse/EnterExitEvents/ResizingFrameTest.java
  • 284c3cde5e1b7115fb17c51f3ed17c1be95845bc: 8336299: Improve GCLocker stall diagnostics
  • 2e96f159aaee782a627902c04dbd51daa3406ab5: 8338489: Typo in MemorySegment doc
  • ... and 18 more: https://git.openjdk.org/jdk/compare/a15af6998e8f7adac2ded94ef5a47e22ddb53452...master

As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details.

As you do not have Committer status in this project an existing Committer must agree to sponsor your change. Possible candidates are the reviewers of this PR (@jaskarth, @eme64, @chhagedorn, @TobiHartmann) but any other Committer may sponsor as well.

➡️ To flag this PR as ready for integration with the above commit message, type /integrate in a new comment. (Afterwards, your sponsor types /sponsor in a new comment to perform the integration).

openjdk[bot] avatar Mar 13 '24 20:03 openjdk[bot]

(x & m) u< m + 1 is false for m = -1, right?

Edit: Yep, filed JDK-8328315.

merykitty avatar Mar 18 '24 04:03 merykitty

Ah, I had assumed the transformation was valid beforehand, since it had existed for a while :sweat_smile: The issue only impacts -1, right? Since the comparison should succeed for both m >=0 and m < -1. I think it would be good to address it in this patch, as it's refactoring the existing code.

The original patch seems to primarily test with array.length as the m value, so the value set was nonnegative. I think we can limit the ((x & m) u< m + 1) transform to cases where m is known to be nonnegative and maintain the intent behind the transform. Something like:

-} else if (_test._test == BoolTest::lt && cmp2->Opcode() == Op_AddI && cmp2->in(2)->find_int_con(0) == 1) {
+} else if (_test._test == BoolTest::lt && cmp2->Opcode() == Op_AddI && cmp2->in(2)->find_int_con(0) == 1 && phase->type(cmp2->in(1))->is_int()->_lo >= 0) {

With the IR test being modified accordingly. It'd also be good to write an IR test method that verifies that the transform doesn't take place if m doesn't succeed the _lo >= 0 test.

jaskarth avatar Mar 19 '24 05:03 jaskarth

@jaskarth Optimally, (x & m) u< m + 1 can be transformed into m != -1 but I think limiting it to non-negative m seems to be a reasonable approach.

merykitty avatar Mar 19 '24 13:03 merykitty

@tabjy I am re-running testing, then will re-review. /reviewers 2 reviewer

eme64 avatar Mar 19 '24 16:03 eme64

@eme64 The total number of required reviews for this PR (including the jcheck configuration and the last /reviewers command) is now set to 2 (with at least 2 Reviewers).

openjdk[bot] avatar Mar 19 '24 16:03 openjdk[bot]

Sorry for not following up closely. The PR has been updated per review suggested.

tabjy avatar Mar 25 '24 16:03 tabjy

Hi @eme64, I'm wondering if you could kindly take a look at the updated commits. Thanks a lot!

tabjy avatar Apr 03 '24 14:04 tabjy

(x & m) u< m + 1 is false for m = -1, right?

This bug should be handled separately. I'll do that.

martinuy avatar Apr 30 '24 21:04 martinuy

/reviewers 2 reviewer

@chhagedorn told me he would also like to review this. I will also run testing again.

eme64 avatar Jun 14 '24 07:06 eme64

@eme64 The total number of required reviews for this PR (including the jcheck configuration and the last /reviewers command) is now set to 2 (with at least 2 Reviewers).

openjdk[bot] avatar Jun 14 '24 07:06 openjdk[bot]

PR updated according to requests. It looked like GHA previously had a problem with serviceability tests across the repo. Let's wait until the new round of testing is complete.

tabjy avatar Jun 18 '24 14:06 tabjy

I've run some testing again and the newly added test failed on all platforms in tier2:

compiler/c2/gvn/TestBoolNodeGVN.java

Additionally required flags:

-XX:-TieredCompilation

Maybe you need to increase the warm-up.

Output:

One or more @IR rules failed:

Failed IR Rules (1) of Methods (1)
----------------------------------
1) Method "public static boolean compiler.c2.gvn.TestBoolNodeGVN.testShouldHaveCpmU(int,int)" - [Failed IR rules: 1]:
   * @IR rule 1: "@compiler.lib.ir_framework.IR(phase={AFTER_PARSING}, applyIfPlatformAnd={}, applyIfCPUFeatureOr={}, counts={"_#CMP_U#_", "4"}, applyIfPlatform={}, failOn={}, applyIfPlatformOr={"x64", "true", "aarch64", "true", "riscv64", "true"}, applyIfOr={}, applyIfCPUFeatureAnd={}, applyIf={}, applyIfCPUFeature={}, applyIfAnd={}, applyIfNot={})"
     > Phase "After Parsing":
       - counts: Graph contains wrong number of nodes:
         * Constraint 1: "(\\d+(\\s){2}(CmpU.*)+(\\s){2}===.*)"
           - Failed comparison: [found] 2 = 4 [given]
             - Matched nodes (2):
               * 31  CmpU  === _ 23 27  [[ 32 ]]  !jvms: TestBoolNodeGVN::testShouldHaveCpmU @ bci:9 (line 66)
               * 60  CmpU  === _ 23 57  [[ 61 ]]  !jvms: TestBoolNodeGVN::testShouldHaveCpmU @ bci:44 (line 68)

chhagedorn avatar Jun 26 '24 06:06 chhagedorn

@tabjy any update on this? Let us know if you need help with reproducing the issue that Christian reported. Thanks!

TobiHartmann avatar Jul 08 '24 13:07 TobiHartmann

@TobiHartmann Thanks for the reminder.

I had no problem reproducing this issue, but what ultimately caused it is more involved and has something to do with a possible bug in the IR test framework itself. Let me explain:

First, enabling/disabling TieredCompilation does make a difference in produced IR. This has nothing to do # of executions during warm-up but rather the discrepancy in profiling: with TieredCompilation enlabled, profiling is done at C1 level, and there are 2 additional Cmp_U3 nodes and uncommon traps.

Second, the IR framework seems to be incorrectly matching Cmp_U3 as Cmp_U nodes. Afaik, the framework uses regex to match strings and determine the type of a node [0]. Because CmpU is also a prefix of CmpU3, there could be a problem with pattern matching. However, I've not confirmed this. If this is the case, it might deserve a separate issue.

Third, I made incorrect assumption on the number of Cmp_U nodes to begin with:

        return !(Integer.compareUnsigned((x & m), m - 1) > 0) |
                !(Integer.compareUnsigned((m & x), m - 1) > 0) |
                Integer.compareUnsigned((x & m), m + 2) < 0 |
                Integer.compareUnsigned((m & x), m + 2) < 0;

I was under the assumption that the 4 cases would produce 4 Cmp_U nodes; however, the first and the second are semantically equivelent and optmized into 1. The same goes for the latter two. With the additional 2 miscounted Cmp_U3, they coincidently add to 4 and passed test with TieredCompilation enabled. (But not with -XX:-TieredCompilation).

tabjy avatar Jul 08 '24 16:07 tabjy

I pushed a commit to spread test cases compounded with & and | into subcases to avoid optimizing out semantically equivalent ones and to make test clearer. To make test passing even with CmpU3 miscounted as CmpU, I specified counts = {IRNode.CMP_U, ">=1"} instead of strictly 1. I hope this is acceptable.


A case illustrating CmpU3 matched as CmpU:

    @Test
    @Arguments(values = {Argument.DEFAULT, Argument.DEFAULT})
    @IR(counts = {IRNode.CMP_U, "1"},  // <-- expecting strictly 1
        phase = CompilePhase.AFTER_PARSING,
        applyIfPlatformOr = {"x64", "true", "aarch64", "true", "riscv64", "true"})
    public static boolean testShouldHaveCpmUCase1(int x, int m) {
        return !(Integer.compareUnsigned((x & m), m - 1) > 0);
    }
1) Method "public static boolean compiler.c2.gvn.TestBoolNodeGVN.testShouldHaveCpmUCase1(int,int)" - [Failed IR rules: 1]:
   * @IR rule 1: "@compiler.lib.ir_framework.IR(phase={AFTER_PARSING}, applyIfPlatformAnd={}, applyIfCPUFeatureOr={}, counts={"_#CMP_U#_", "1"}, applyIfPlatform={}, failOn={}, applyIfPlatformOr={"x64", "true", "aarch64", "true", "riscv64", "true"}, applyIfOr={}, applyIfCPUFeatureAnd={}, applyIf={}, applyIfCPUFeature={}, applyIfAnd={}, applyIfNot={})"
     > Phase "After Parsing":
       - counts: Graph contains wrong number of nodes:
         * Constraint 1: "(\d+(\s){2}(CmpU.*)+(\s){2}===.*)"
           - Failed comparison: [found] 2 = 1 [given]
             - Matched nodes (2):
mismatched-->  * 28  CmpU3  === _ 23 27  [[ 39 ]]  !jvms: TestBoolNodeGVN::testShouldHaveCpmUCase1 @ bci:6 (line 93)
               * 31  CmpU  === _ 23 27  [[ 32 ]]  !jvms: TestBoolNodeGVN::testShouldHaveCpmUCase1 @ bci:9 (line 93)

tabjy avatar Jul 08 '24 16:07 tabjy

Thanks @tabjy! @chhagedorn, our IR Framework expert, is currently on vacation but will be back later this week.

TobiHartmann avatar Jul 22 '24 06:07 TobiHartmann

@chhagedorn should we change the regex to have a space at the end, so that we do not do this kind of prefix-matching?

eme64 avatar Jul 22 '24 06:07 eme64

@chhagedorn should we change the regex to have a space at the end, so that we do not do this kind of prefix-matching?

I think that would be a better solution to change the regex. Then we can change the test in such a way that it does an exact counting. Maybe we should move through all the IRNode regexes at some point and make sure that all placeholder strings match a unique node.

We already do that, for example, to match CountedLoop and not CountedLoopEnd by additionally matching the word boundary of the node name with \b: https://github.com/openjdk/jdk/blob/05d88de05e9b7814ecd5517aacd17f0feafdff3c/test/hotspot/jtreg/compiler/lib/ir_framework/IRNode.java#L495-L499

We could do something similar for CmpU (and we should then probably also apply it for CmpUL):

    public static final String CMP_U = PREFIX + "CMP_U" + POSTFIX;
    static {
        String regex = START + "CmpU\\b" + MID + END;
        beforeMatching(CMP_U, regex);
    }

chhagedorn avatar Jul 24 '24 10:07 chhagedorn

@tabjy Are you planning to keep working on this? I talked with @chhagedorn and we would like you to change the IR rule to something like @IR(counts = {IRNode.CMP_U + "\b", "1"}

In a later and separate RFE, we can then adjust the regex for all nodes, in a bulk update.

eme64 avatar Aug 15 '24 06:08 eme64

In a later and separate RFE, we can then adjust the regex for all nodes, in a bulk update.

Good idea, I filed JDK-8338809 to follow up on this after this PR gets integrated. But for now, I suggest to use an explicit regex as suggested above with @IR(counts = {IRNode.CMP_U + "\b", "1"}. We can then follow up and change this IR test once JDK-8338809 is tackled.

chhagedorn avatar Aug 22 '24 05:08 chhagedorn

I sincerely apologize for not following up on this timely. This won't happen again.

[...] change the IR rule to something like @IR(counts = {IRNode.CMP_U + "\b", "1"}

First, I assume you mean ... + "\\b" (double escaped for regex).

Unfortunately this does not work. IRNode.CMP_U has a postfix #_, making the expression _#CMP_U#_\\b.

https://github.com/openjdk/jdk/blob/a15af6998e8f7adac2ded94ef5a47e22ddb53452/test/hotspot/jtreg/compiler/lib/ir_framework/IRNode.java#L440-L443

Even if we explicitly make it @IR(counts = {"_#CMP_U\\b#_", "1"}, it is still the one without the \b registered by beforeMatchingNameRegex(irNodePlaceholder, irNodeRegex), leading to unexpected node type during assertion:

Violations (4)
--------------
 - IR Node "_#CMP_U\b#_" defined in class IRNode has no regex/compiler phase mapping (i.e. no static initializer block that adds a mapping entry to IRNode.IR_NODE_MAPPINGS).
   Have you just created the entry "_#CMP_U\b#_" in class IRNode and forgot to add a mapping?
   Violation for IR rule 1 at public static boolean compiler.c2.gvn.TestBoolNodeGVN.testShouldHaveCpmUCase1(int,int).
 - [repeated violations omitted]

I think it's the second argument to beforeMatchingNameRegex(irNodePlaceholder, irNodeRegex) you want to add the word break to. Not the placeholder. This can only be done in IRNode.java. I propose the following change:

 public static final String CMP_U = PREFIX + "CMP_U" + POSTFIX; 
 static { 
-     beforeMatchingNameRegex(CMP_U, "CmpU"); 
+     beforeMatchingNameRegex(CMP_U, "CmpU\\b"); 
 } 

The three existing tests currently referencing IRNode.CMP_U: compiler.c2.irTests.CmpUWithZero, compiler.intrinsics.TestCompareUnsigned, compiler.c2.irTests.TestUnsignedComparison, are all passing w/o this change. It does not break existing tests.

I'm going to push a commit to do so. If you think it's not appropriate to change IRNode.java with the scope of this issue, I can revert it.

tabjy avatar Aug 26 '24 17:08 tabjy

Hi @tabjy, no worries! You're right, you cannot just append the \\b as suggested above - this was only possible in an older version of the IR framework. I think going with what you suggested with beforeMatchingNameRegex(CMP_U, "CmpU\\b") should do the trick. You can go ahead and push that. Then I can run some internal testing again, just to be sure.

chhagedorn avatar Aug 27 '24 07:08 chhagedorn

@chhagedorn It's already pushed. The HEAD has the up to date master merged in. Please let me know how the test goes. Thanks!

tabjy avatar Aug 27 '24 07:08 tabjy

Right, missed that. Testing is submitted, I report back once it's complete

chhagedorn avatar Aug 27 '24 07:08 chhagedorn