Triton Reduce usage of concrete evaluation in x86 semantics

Hi @JonathanSalwan,

In x86semantics.cpp, there are 46 instruction handlers where the emitted ASTs are dependent upon a concrete value retrieved through .evaluate():

[ ] x86Semantics::cmpsb_s
[ ] x86Semantics::cmpsd_s
[ ] x86Semantics::cmpsq_s
[ ] x86Semantics::cmpxchg_s
[ ] x86Semantics::cmpxchg8b_s
[ ] x86Semantics::cpuid_s
[ ] x86Semantics::div_s
[ ] x86Semantics::idiv_s
[ ] x86Semantics::lodsb_s
[ ] x86Semantics::lodsd_s
[ ] x86Semantics::lodsq_s
[ ] x86Semantics::lodsw_s
[ ] x86Semantics::movsb_s
[ ] x86Semantics::movsd_s
[ ] x86Semantics::movsq_s
[ ] x86Semantics::movsw_s
[ ] x86Semantics::pextrb_s
[ ] x86Semantics::pextrd_s
[ ] x86Semantics::pextrq_s
[ ] x86Semantics::pextrw_s
[ ] x86Semantics::pinsrb_s
[ ] x86Semantics::pinsrd_s
[ ] x86Semantics::pinsrq_s
[ ] x86Semantics::pinsrw_s
[ ] x86Semantics::rcl_s
[ ] x86Semantics::rcr_s
[ ] x86Semantics::rol_s
[ ] x86Semantics::ror_s
[ ] x86Semantics::sar_s
[ ] x86Semantics::scasb_s
[ ] x86Semantics::scasd_s
[ ] x86Semantics::scasq_s
[ ] x86Semantics::scasw_s
[ ] x86Semantics::shl_s
[ ] x86Semantics::shld_s
[ ] x86Semantics::shr_s
[ ] x86Semantics::shrd_s
[ ] x86Semantics::stosb_s
[ ] x86Semantics::stosd_s
[ ] x86Semantics::stosq_s
[ ] x86Semantics::stosw_s
[ ] x86Semantics::vextracti128_s
[ ] x86Semantics::vperm2i128_s
[ ] x86Semantics::vpextrb_s
[ ] x86Semantics::vpextrq_s
[ ] x86Semantics::vpextrw_s

There are some similar cases(push/pushfq/pop/popfq, fxsave/fxrstor, ret, etc) involving symbolic memory, but IMO they're out of the scope of this issue since Triton does not have STORE/LOAD ast nodes.

One step towards https://github.com/JonathanSalwan/Triton/issues/473 would be to replace .evaluate() usages with code to emit AST nodes which depend on a symbolic value. Any objections to me opening a PR for this?

You can break the usages of .evaluate() into four categories:

String instructions (e.g. scasb) where the concrete value of cx is used to determine whether any nodes should be emitted
Exception raisers(e.g. div_s) where the exception variable is set for cases where x86 would raise an exception
General cases(e.g. pinsrb_s, vextracti128_s) where I'm not sure why a concrete value is used instead of a symbolic value
Conditional undefines of a register(e.g. rol_s where of is set to an undefined value if the src operand is greater than 1)

For these categories:

An ITE node (e.g. 'dst = ite(cx == 0, original_value, new_value)` could be used instead
Exception raising is probably out of the scope of this issue, since the IR has no way of modeling exception raising. I would leave this as is.
A symbolic value could be used
An ITE node(e.g. of = ite(src > 1, undef, of)) could be used

Apr 12 '23 07:04 Colton1skees

Won't this break the dynamic symbolic execution approach where we make the constraints based on the path that was actually executed?

Apr 12 '23 08:04 SweetVishnya

Anyway, if we decide to proceed with the PR, we have local DSE benchmarks to measure symbolic execution accuracy and speed.

Apr 12 '23 08:04 SweetVishnya

Won't this break the dynamic symbolic execution approach where we make the constraints based on the path that was actually executed?

@SweetVishnya Can you give a case example? The changes to handling undefined flags should be fine I think - the only case I can see a potential problem with is the string related instructions

Apr 12 '23 08:04 Colton1skees

Won't this break the dynamic symbolic execution approach where we make the constraints based on the path that was actually executed?

@SweetVishnya Can you give a case example? The changes to handling undefined flags should be fine I think - the only case I can see a potential problem with is the string related instructions

Yeah, now Triton decides the number of times to execute REP string instructions.

Apr 12 '23 08:04 SweetVishnya

Maybe, we should split this PR by instruction groups. I can benchmark symbolic execution accuracy on our side. And we can merge succeeding PRs one by one.

Apr 12 '23 08:04 SweetVishnya

@SweetVishnya Take stosb as an example.

With the change I suggested, this code:

/* Create symbolic expression */
auto expr1 = this->symbolicEngine->createSymbolicExpression(inst, node1, dst, "STOSD operation");
auto expr2 = this->symbolicEngine->createSymbolicExpression(inst, node2, index, "Index operation");

would be transformed into something along the lines of this(in pseudo code)

/* Create symbolic expression */
auto src1 = ITE(cx != 0,  node1, getOperandAst(dst))
auto src2 = ITE(cx != 0,  node2, getOperandAst(index))

auto expr1 = this->symbolicEngine->createSymbolicExpression(inst, src1, dst, "STOSD operation");
auto expr2 = this->symbolicEngine->createSymbolicExpression(inst, src2, index, "Index operation");

Essentially both destinations are only being assigned a new value if the counter(cx) is not set to zero. The logic for computing the next RIP should remain unchanged. With this context, do you still think this may cause problems?

Apr 12 '23 08:04 Colton1skees

@Colton1skees, I do not see the problem here. But you never know what break symbolic reasoning in real world. We may try to benchmark all string instructions in a separate PR. If everything is OK, we can merge it. More info about our benchmarking can be found in Evaluation section.

Apr 12 '23 09:04 SweetVishnya

So far for this stosb instruction, I see no problem. What I suggest is to make one PR per each instruction with a bunch of unit tests to be sure we do not break anything. So that it's easy for me to merge ones that are working perfectly and to discuss ones that could break things.

Apr 12 '23 09:04 JonathanSalwan

In a general point of view, removing all evaluate() from the instruction semantics could be a step forward to solve #473.

Apr 12 '23 09:04 JonathanSalwan

So far for this stosb instruction, I see no problem. What I suggest is to make one PR per each instruction with a bunch of unit tests to be sure we do not break anything. So that it's easy for me to merge ones that are working perfectly and to discuss ones that could break things.

One PR for each instruction can be an overkill. Maybe, we can group them somehow?

Apr 12 '23 09:04 SweetVishnya

Yeah I think grouping them would be ok :)

Apr 12 '23 09:04 JonathanSalwan

Moreover, running one benchmark requires one night of a time ;)

Apr 12 '23 09:04 SweetVishnya

Maybe, we should split this PR by instruction groups.

Agreed, #1/#3/#4 should be split into separate. grouped PRs/

But you never know what break symbolic reasoning in real world. We may try to benchmark all string instructions in a separate PR. If everything is OK, we can merge it. More info about our benchmarking can be found in Evaluation section.

Understood. If I have time to PR this, I'll try to put together some relatively exhaustive tests.

Also, is there any existing DSE accuracy benchmark which I can run locally? Afaik Sydr is closed source

Apr 12 '23 09:04 Colton1skees

@Colton1skees, unfortunately our benchmark is closed source( because it requires Sydr. We just open source the benchmark binaries.

Apr 12 '23 09:04 SweetVishnya

In a general point of view, removing all evaluate() from the instruction semantics could be a step forward to solve https://github.com/JonathanSalwan/Triton/issues/473.

I have a .NET port of Triton(mostly the translator / semantics) here which has a standalone static version of Triton's IR translator(including support for symbolic memory with STORE/LOAD nodes). On the topic of #473, I wonder how hard it would be to try and update the rest of the Triton codebase to handle the addition of STORE/LOAD nodes.

Apr 12 '23 09:04 Colton1skees

@Colton1skees, I also did some experiments with symbolic pointers long time ago here.

Apr 12 '23 09:04 SweetVishnya

STORE/LOAD nodes are already implemented (#1185) and we have modes for this now:

https://github.com/JonathanSalwan/Triton/blob/master/src/libtriton/includes/triton/modesEnums.hpp#L34
https://github.com/JonathanSalwan/Triton/blob/master/src/libtriton/includes/triton/modesEnums.hpp#L39-L40

Some examples here:

https://github.com/JonathanSalwan/Triton/blob/master/src/testers/unittests/test_symbolic_array.py

However, i'm pretty sure we can do better to improve how we deal with memory array :)

Apr 12 '23 09:04 JonathanSalwan

@Colton1skees, also writing C# bindings for Triton would be easier than copy-pasting the code.

Apr 12 '23 09:04 SweetVishnya

I also did some experiments with symbolic pointers long time ago here.

Neat

also writing C# bindings for Triton would be easier than copy-pasting the code.

If I linked a complete .NET port of Triton, then yes writing bindings would make sense. The repo I linked is basically just a standalone extraction of the IR translator, with some changes specific to my use case(e.g. never emitting reads/writes to concrete memory addresses, switching from capstone to ICED). Writings bindings wasn't necessary.

Apr 12 '23 11:04 Colton1skees

Triton Triton copied to clipboard

Reduce usage of concrete evaluation in x86 semantics

Triton
Triton copied to clipboard