json bench runner can run external command

Fix #1026.

The idea is that you can run an external command from the benchmark. The command is communicated via an environment variable. E.g. in bash:

BOOST_JSON_BENCH_EXTERNAL_COMMAND="./1.sh" ./bench -i:b

The bench runner invokes the command with these arguments:

start or finish
toolset name (e.g. gcc, clang)
the name of the processed file.

For example, here's a script that does what was suggested in the issue:

#!/bin/sh

op=Staring
if [ "$1" = finish ]; then
    op=Finished
fi
shift

compiler=$1
shift

date +'%Y/%m/%d %H:%M:%S:%3N' >> /tmp/json-benchmarks.log
echo "$op to process $* with $compiler" >> /tmp/json-benchmarks.log
lscpu -e >> /tmp/json-benchmarks.log
top -b -n 1 | head -n 20 >> /tmp/json-benchmarks.log

Jul 19 '24 12:07 grisumbras

An automated preview of the documentation is available at https://1028.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

Jul 19 '24 12:07 cppalliance-bot

Hi @grisumbras , thanks!

There were two ideas in the issue with the newer, later idea being a log file. Upon further consideration I think that would have been preferable, having output nearly identical to the benchmark output, which makes correlating the logs and the benches as easy as possible.

Well, in this case, I will need to generate the log file.

By designing a "./1.sh".

Can 1.sh generate this output, distinguishing Serialize and Parse, and including the bench results times?

2024-07-18-08:10:32:725 Starting Parse mesh.json,clang x64/sse2,boost (pool)
2024-07-18-08:10:33:822 Completed Parse mesh.json,clang x64/sse2,boost (pool),3200,5006,441
2024-07-18-08:10:34:321 Starting Parse mesh.json,clang x64/sse2,boost (pool)
2024-07-18-08:10:35:425 Completed Parse mesh.json,clang x64/sse2,boost (pool),3339,5225,441

The bench code is/was already writing "Parse mesh.json,clang x64/sse2,boost (pool),3200,5006,441" . Since it has that information, is it available as a string, and passed to "./1.sh"?

Jul 19 '24 13:07 sdarwin

Yeah, I think it should be possible.

Jul 19 '24 13:07 grisumbras

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 93.40%. Comparing base (baff1cb) to head (365b46e). Report is 56 commits behind head on develop.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #1028   +/-   ##
========================================
  Coverage    93.40%   93.40%           
========================================
  Files           87       87           
  Lines         8622     8622           
========================================
  Hits          8053     8053           
  Misses         569      569

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update baff1cb...365b46e. Read the comment docs.

Jul 19 '24 13:07 codecov[bot]

The debugging commands lscpu and top ought to be run every second, constantly, during the benchmark tests. Therefore, it becomes less critical to run them before and after each bench. They must also run during the bench. The way to implement that is to have them totally outside of bench. (...I have composed another bash loop script).

for the moment, the current most interesting goal is to generate a log file that looks as similar to the bench output as possible. (but adds timestamps. start/stop.)

Jul 19 '24 13:07 sdarwin

With the new version the accomapying script would be

#!/bin/sh
echo >>/tmp/json-benchmarks.log $(date +'%Y/%m/%d %H:%M:%S:%3N') "$@"

Jul 19 '24 13:07 grisumbras

An automated preview of the documentation is available at https://1028.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

Jul 19 '24 14:07 cppalliance-bot

Great!! I will test it.

Jul 19 '24 14:07 sdarwin

An automated preview of the documentation is available at https://1028.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

Jul 19 '24 14:07 cppalliance-bot

/tmp/benchmark_external_script.sh

#!/bin/bash

touch external_script_is_working.txt
touch /tmp/external_script_is_working.txt

echo >>/tmp/json-benchmarks.log $(date +'%Y/%m/%d %H:%M:%S:%3N') "$@"

Bench command:

BOOST_JSON_BENCH_EXTERNAL_COMMAND=/tmp/benchmark_external_script.sh ./bench -i:b *

Quotes are missing around /tmp/benchmark_external_script.sh to avoid escaping quotes through multiple scripts. However that shouldn't affect things.

Result:

The files external_script_is_working.txt and json-benchmarks.log do not appear anywhere in the filesystem.

Locally, could you try to replicate this using /tmp/benchmark_external_script.sh (not ./benchmark_external_script.sh) and see what happens?

Jul 19 '24 15:07 sdarwin

Everything works locally for me. Please check if the script has executable flag set.

Jul 19 '24 16:07 grisumbras

Ok thanks. I think I know what it is.

Jul 19 '24 16:07 sdarwin

Benchmark test results. More info at https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1028/pullrequest.html

Jul 19 '24 16:07 cppalliance-bot

Suddenly slow canada.json strikes again!

Jul 19 '24 16:07 grisumbras

Compare the numerical results at the end of the line.

Parse apache_builds.json,clang x64/sse2,boost (pool),15000,5039,361
Parse apache_builds.json,clang x64/sse2,boost (pool),15624,5221,363
Parse apache_builds.json,clang x64/sse2,boost (pool),15708,5245,364
Parse apache_builds.json,clang x64/sse2,boost (pool),14960,5020,362
Parse apache_builds.json,clang x64/sse2,boost (pool),15645,5237,363
Parse apache_builds.json,clang x64/sse2,boost (pool),15666,5247,362

/tmp/json-benchmarks.log

2024/07/19 16:56:59:225 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:04:271 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),15,5039,0
2024/07/19 16:57:04:276 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:09:503 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5221,0
2024/07/19 16:57:09:508 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:14:760 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5245,0
2024/07/19 16:57:14:765 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:19:791 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),20,5020,0
2024/07/19 16:57:19:796 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:25:039 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5237,0
2024/07/19 16:57:25:044 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:30:297 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5247,0

The current benchmarks might be lucky to have a chipset from 2010. The new machine exhibits more randomness. This is the problem -

https://www.kernel.org/doc/Documentation/cpu-freq/intel-pstate.txt

For contemporary Intel processors, the frequency is controlled by the
processor itself and the P-State exposed to software is related to
performance levels.  The idea that frequency can be set to a single
frequency is fictional for Intel Core processors. Even if the scaling
driver selects a single P-State, the actual frequency the processor
will run at is selected by the processor itself.

Jul 19 '24 18:07 sdarwin

To be honest, this doesn't explain to me the canada.json phenomenon. Or this PR wasn't an attempt to investigate it?

Jul 25 '24 20:07 grisumbras

This PR was an attempt to analyze and debug the performance of a new server, in general. Not specific to canada.json.
A benchmark machine should be as stable as possible, with replicable benchmarks.

Now there is a new server. So far it has been mostly consistent, with variability in the twitter data set. If that continues it could be worthwhile to examine the data and try to figure out why.

Jul 25 '24 20:07 sdarwin

Something else affecting the current benchmarks is that new machine is running Ubuntu 22.04 and clang-14. As soon as there is a supported upgrade path to Ubuntu 24.04 and clang-18 I will upgrade. That could improve the results. ( "It is recommended to wait until the release of 24.04.1 LTS (scheduled for August 15th, 2024) before upgrading from Ubuntu 22.04 LTS." )

Jul 25 '24 22:07 sdarwin

Interesting. It looks like on this new machine it doesn't like parsing twitter.json with clang. https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1031/pullrequest.html

Jul 26 '24 04:07 grisumbras

It looks like on this new machine it doesn't like parsing twitter.json with clang.

Right. The results have a lot of randomization. Either red or green is not important there.
However, the other machine, an i5 processor, didn't get that. What was different? (kernel, clang, cpu). That's why the next OS upgrade could be informative. Although "remove unnecessary branch in string serialization" seems to have an affect also.

Jul 26 '24 09:07 sdarwin