staq Optimization by self-annihilating sequences

I use two test cases to illustrate an issue.

Case 1 - cx gates

OPENQASM 2.0;
include "qelib1.inc";

qreg q[2];
h q;

cx q[0], q[1];
cx q[0], q[1];
cx q[0], q[1];
cx q[0], q[1];

After staq with optimization:

OPENQASM 2.0;
include "qelib1.inc";

qreg q[2];
h q[0];
h q[1];

The cx gates have been cancelled nicely.

Case 2 - ccx gates

OPENQASM 2.0;
include "qelib1.inc";

qreg q[3];
h q;

ccx q[0], q[1], q[2];
ccx q[0], q[1], q[2];
ccx q[0], q[1], q[2];
ccx q[0], q[1], q[2];

After staq with optimization:

OPENQASM 2.0;
include "qelib1.inc";

qreg q[3];
h q[0];
h q[1];
z q[2];
z q[1];
cx q[2],q[1];
rz(((pi*3)/2)+((pi*3)/2)) q[1];
s q[0];
cx q[2],q[0];
rz(((pi*3)/2)+((pi*3)/2)) q[0];
cx q[1],q[0];
sdg q[0];
cx q[2],q[0];
z q[0];
cx q[2],q[1];
cx q[2],q[0];
cx q[1],q[0];
h q[2];
s q[0];
cx q[1],q[0];
sdg q[0];
cx q[1],q[0];

The ccx gates should also cancel each other out, shouldn't they?

I read an old article, which may not be directly related to what the staq wants to achieve. Nevertheless, the authors introduced an interesting idea of removing self-annihilating sequences. I am wondering if staq could borrow the idea and repurpose it to optimize similar sequences for staq applications.

In the classical world, a decent modern IDE can use static analysis to identify a lot of redundant code, dead code, suspicious code, no-op blocks, obvious stupidity, etc., before the complier is run. Do you think if staq can introduce a pass to target those usual suspects as well, especially the self-annihilating sequences? In the long run, that may even help the IDE vendors.

Feb 02 '21 02:02 DevelopDaily

@DevelopDaily Absolutely it would be nice to have this optimization (and others)! Curiously, I thought we already had something that would do this in staq. I just had a look at the code in /include/optimization/simplify.hpp and it looks like was no rule for ccx gates. I just added a rule and pushed an update, the output for the ccx version I'm getting now is (as expected)

OPENQASM 2.0;
include "qelib1.inc";

qreg q[3];
h q[0];
h q[1];
h q[2];

Thanks for pointing this out!

In general we're always looking to get more optimizations into staq, and Vlad and I both do research in quantum compiler optimizations. I've got a few in the works that will hopefully be added into staq in the future, but new ideas are always helpful too!

Feb 02 '21 23:02 meamy

Indeed, ccx works well now. Thanks!

I'm glad you guys are doing research constantly in this area. I hope staq will be able to trim the fat from all kinds of obese code eventually.

Since ccx simplification is implemented with a rule-based approach, it won't be able to deal with cccx or gates with arbitrary number of control bits. So, I will post a test case for cccx here for the record. One day, I will come back to test it:-)

OPENQASM 2.0;
include "qelib1.inc";

gate cccx ctrl_0, ctrl_1, ctrl_2, q0
{
    s q0;
    h q0;
    t q0;
    cx ctrl_0,q0;
    tdg q0;
    h q0;
    rz((pi/4)/2) ctrl_2;
    t ctrl_0;
    cx ctrl_2,ctrl_0;
    rz(-(pi/4)/2) ctrl_0;
    rz(((pi*3)/2)+((pi/4)/2)) q0;
    cx ctrl_0,q0;
    cx ctrl_2,q0;
    rz(-(pi/4)/2) q0;
    cx ctrl_2,ctrl_0;
    cx ctrl_0,q0;
    h q0;
    t q0;
    cx ctrl_2,q0;
    tdg q0;
    cx ctrl_0,q0;
    t q0;
    cx ctrl_2,q0;
    tdg q0;
    cx ctrl_0,q0;
    h q0;
    rz((-pi/4)/2) q0;
    rz((pi/4)+((-pi/4)/2)) ctrl_0;
    cx ctrl_2,ctrl_0;
    tdg ctrl_0;
    cx q0,ctrl_0;
    cx ctrl_2,ctrl_0;
    rz(-(-pi/4)/2) ctrl_0;
    cx q0,ctrl_0;
    h q0;
    t q0;
    cx ctrl_1,q0;
    tdg q0;
    cx ctrl_0,q0;
    t q0;
    cx ctrl_1,q0;
    tdg q0;
    cx ctrl_0,q0;
    h q0;
    rz((pi/4)/2) q0;
    rz((pi/4)+((pi/4)/2)) ctrl_0;
    cx ctrl_1,ctrl_0;
    tdg ctrl_0;
    cx q0,ctrl_0;
    cx ctrl_1,ctrl_0;
    rz(-(pi/4)/2) ctrl_0;
    cx q0,ctrl_0;
    h q0;
    s ctrl_2;
    t q0;
    cx ctrl_2,q0;
    tdg q0;
    cx ctrl_0,q0;
    t q0;
    cx ctrl_2,q0;
    tdg q0;
    cx ctrl_0,q0;
    h q0;
    rz((-pi/4)/2) q0;
    rz((pi/4)+((-pi/4)/2)) ctrl_0;
    cx ctrl_2,ctrl_0;
    tdg ctrl_0;
    cx q0,ctrl_0;
    cx ctrl_2,ctrl_0;
    rz(-(-pi/4)/2) ctrl_0;
    cx q0,ctrl_0;
    h q0;
    s ctrl_1;
    t q0;
    cx ctrl_1,q0;
    tdg q0;
    cx ctrl_0,q0;
    t q0;
    cx ctrl_1,q0;
    tdg q0;
    cx ctrl_0,q0;
    h q0;
    t ctrl_0;
    cx ctrl_1,ctrl_0;
    tdg ctrl_0;
    s q0;
    cx ctrl_1,ctrl_0;
    h q0;
    t q0;
    cx ctrl_0,q0;
    tdg q0;
    h q0;
    rz((pi/4)/2) ctrl_1;
    rz((pi/4)/2) ctrl_0;
    cx ctrl_1,ctrl_0;
    rz(-(pi/4)/2) ctrl_0;
    sdg q0;
    cx ctrl_1,ctrl_0;
    h ctrl_1;
    t ctrl_1;
    cx ctrl_2,ctrl_1;
    tdg ctrl_1;
    cx ctrl_0,ctrl_1;
    t ctrl_1;
    cx ctrl_2,ctrl_1;
    tdg ctrl_1;
    cx ctrl_0,ctrl_1;
    h ctrl_1;
    rz((-pi/4)/2) ctrl_1;
    rz((pi/4)+((-pi/4)/2)) ctrl_0;
    cx ctrl_2,ctrl_0;
    tdg ctrl_0;
    cx ctrl_1,ctrl_0;
    cx ctrl_2,ctrl_0;
    rz(-(-pi/4)/2) ctrl_0;
    cx ctrl_1,ctrl_0;
    h ctrl_1;
    s ctrl_2;
    t ctrl_1;
    cx ctrl_2,ctrl_1;
    tdg ctrl_1;
    cx ctrl_0,ctrl_1;
    t ctrl_1;
    cx ctrl_2,ctrl_1;
    tdg ctrl_1;
    cx ctrl_0,ctrl_1;
    h ctrl_1;
    t ctrl_0;
    cx ctrl_2,ctrl_0;
    tdg ctrl_0;
    cx ctrl_2,ctrl_0;
}

qreg q[4];

cccx q[0],q[1],q[2],q[3];
cccx q[0],q[1],q[2],q[3];

Feb 03 '21 01:02 DevelopDaily