bench runner can run external command #1028

grisumbras · 2024-07-19T12:37:52Z

The idea is that you can run an external command from the benchmark. The command is communicated via an environment variable. E.g. in bash:

BOOST_JSON_BENCH_EXTERNAL_COMMAND="./1.sh" ./bench -i:b

The bench runner invokes the command with these arguments:

start or finish
toolset name (e.g. gcc, clang)
the name of the processed file.

For example, here's a script that does what was suggested in the issue:

#!/bin/sh

op=Staring
if [ "$1" = finish ]; then
    op=Finished
fi
shift

compiler=$1
shift

date +'%Y/%m/%d %H:%M:%S:%3N' >> /tmp/json-benchmarks.log
echo "$op to process $* with $compiler" >> /tmp/json-benchmarks.log
lscpu -e >> /tmp/json-benchmarks.log
top -b -n 1 | head -n 20 >> /tmp/json-benchmarks.log

cppalliance-bot · 2024-07-19T12:47:23Z

An automated preview of the documentation is available at https://1028.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

sdarwin · 2024-07-19T13:03:08Z

Hi @grisumbras , thanks!

There were two ideas in the issue with the newer, later idea being a log file. Upon further consideration I think that would have been preferable, having output nearly identical to the benchmark output, which makes correlating the logs and the benches as easy as possible.

Well, in this case, I will need to generate the log file.

By designing a "./1.sh".

Can 1.sh generate this output, distinguishing Serialize and Parse, and including the bench results times?

2024-07-18-08:10:32:725 Starting Parse mesh.json,clang x64/sse2,boost (pool)
2024-07-18-08:10:33:822 Completed Parse mesh.json,clang x64/sse2,boost (pool),3200,5006,441
2024-07-18-08:10:34:321 Starting Parse mesh.json,clang x64/sse2,boost (pool)
2024-07-18-08:10:35:425 Completed Parse mesh.json,clang x64/sse2,boost (pool),3339,5225,441

The bench code is/was already writing "Parse mesh.json,clang x64/sse2,boost (pool),3200,5006,441" . Since it has that information, is it available as a string, and passed to "./1.sh"?

grisumbras · 2024-07-19T13:11:30Z

Yeah, I think it should be possible.

codecov · 2024-07-19T13:12:10Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 93.40%. Comparing base (baff1cb) to head (365b46e).

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #1028   +/-   ##
========================================
  Coverage    93.40%   93.40%           
========================================
  Files           87       87           
  Lines         8622     8622           
========================================
  Hits          8053     8053           
  Misses         569      569

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update baff1cb...365b46e. Read the comment docs.

sdarwin · 2024-07-19T13:19:49Z

The debugging commands lscpu and top ought to be run every second, constantly, during the benchmark tests. Therefore, it becomes less critical to run them before and after each bench. They must also run during the bench. The way to implement that is to have them totally outside of bench. (...I have composed another bash loop script).

for the moment, the current most interesting goal is to generate a log file that looks as similar to the bench output as possible. (but adds timestamps. start/stop.)

grisumbras · 2024-07-19T13:57:24Z

With the new version the accomapying script would be

#!/bin/sh
echo >>/tmp/json-benchmarks.log $(date +'%Y/%m/%d %H:%M:%S:%3N') "$@"

cppalliance-bot · 2024-07-19T14:02:23Z

An automated preview of the documentation is available at https://1028.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

sdarwin · 2024-07-19T14:06:12Z

Great!! I will test it.

cppalliance-bot · 2024-07-19T14:12:58Z

An automated preview of the documentation is available at https://1028.jsondocs.prtest.cppalliance.org/libs/json/doc/html/index.html

sdarwin · 2024-07-19T15:43:01Z

/tmp/benchmark_external_script.sh

#!/bin/bash

touch external_script_is_working.txt
touch /tmp/external_script_is_working.txt

echo >>/tmp/json-benchmarks.log $(date +'%Y/%m/%d %H:%M:%S:%3N') "$@"

Bench command:

BOOST_JSON_BENCH_EXTERNAL_COMMAND=/tmp/benchmark_external_script.sh ./bench -i:b *

Quotes are missing around /tmp/benchmark_external_script.sh to avoid escaping quotes through multiple scripts. However that shouldn't affect things.

Result:

The files external_script_is_working.txt and json-benchmarks.log do not appear anywhere in the filesystem.

Locally, could you try to replicate this using /tmp/benchmark_external_script.sh (not ./benchmark_external_script.sh) and see what happens?

grisumbras · 2024-07-19T16:07:28Z

Everything works locally for me. Please check if the script has executable flag set.

sdarwin · 2024-07-19T16:14:02Z

Ok thanks. I think I know what it is.

cppalliance-bot · 2024-07-19T16:23:39Z

Benchmark test results. More info at https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1028/pullrequest.html

grisumbras · 2024-07-19T16:32:52Z

Suddenly slow canada.json strikes again!

sdarwin · 2024-07-19T18:47:18Z

Compare the numerical results at the end of the line.

Parse apache_builds.json,clang x64/sse2,boost (pool),15000,5039,361
Parse apache_builds.json,clang x64/sse2,boost (pool),15624,5221,363
Parse apache_builds.json,clang x64/sse2,boost (pool),15708,5245,364
Parse apache_builds.json,clang x64/sse2,boost (pool),14960,5020,362
Parse apache_builds.json,clang x64/sse2,boost (pool),15645,5237,363
Parse apache_builds.json,clang x64/sse2,boost (pool),15666,5247,362

/tmp/json-benchmarks.log

2024/07/19 16:56:59:225 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:04:271 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),15,5039,0
2024/07/19 16:57:04:276 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:09:503 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5221,0
2024/07/19 16:57:09:508 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:14:760 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5245,0
2024/07/19 16:57:14:765 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:19:791 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),20,5020,0
2024/07/19 16:57:19:796 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:25:039 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5237,0
2024/07/19 16:57:25:044 Starting Parse apache_builds.json,clang x64/sse2,boost (pool)
2024/07/19 16:57:30:297 Completed Parse apache_builds.json,clang x64/sse2,boost (pool),21,5247,0

The current benchmarks might be lucky to have a chipset from 2010. The new machine exhibits more randomness. This is the problem -

https://www.kernel.org/doc/Documentation/cpu-freq/intel-pstate.txt

For contemporary Intel processors, the frequency is controlled by the
processor itself and the P-State exposed to software is related to
performance levels.  The idea that frequency can be set to a single
frequency is fictional for Intel Core processors. Even if the scaling
driver selects a single P-State, the actual frequency the processor
will run at is selected by the processor itself.

grisumbras · 2024-07-25T20:34:05Z

To be honest, this doesn't explain to me the canada.json phenomenon. Or this PR wasn't an attempt to investigate it?

sdarwin · 2024-07-25T20:45:33Z

This PR was an attempt to analyze and debug the performance of a new server, in general. Not specific to canada.json.
A benchmark machine should be as stable as possible, with replicable benchmarks.

Now there is a new server. So far it has been mostly consistent, with variability in the twitter data set. If that continues it could be worthwhile to examine the data and try to figure out why.

sdarwin · 2024-07-25T22:57:40Z

Something else affecting the current benchmarks is that new machine is running Ubuntu 22.04 and clang-14. As soon as there is a supported upgrade path to Ubuntu 24.04 and clang-18 I will upgrade. That could improve the results. ( "It is recommended to wait until the release of 24.04.1 LTS (scheduled for August 15th, 2024) before upgrading from Ubuntu 22.04 LTS." )

grisumbras · 2024-07-26T04:45:15Z

Interesting. It looks like on this new machine it doesn't like parsing twitter.json with clang.
https://benchmark.cppalliance.org/jsonbenchmarks-pullrequests/1031/pullrequest.html

sdarwin · 2024-07-26T09:36:23Z

It looks like on this new machine it doesn't like parsing twitter.json with clang.

Right. The results have a lot of randomization. Either red or green is not important there.
However, the other machine, an i5 processor, didn't get that. What was different? (kernel, clang, cpu). That's why the next OS upgrade could be informative. Although "remove unnecessary branch in string serialization" seems to have an affect also.

grisumbras force-pushed the bench/exec-external branch from 5ff581c to 4ca398f Compare July 19, 2024 13:56

bench runner can run external command

365b46e

grisumbras force-pushed the bench/exec-external branch from 4ca398f to 365b46e Compare July 19, 2024 13:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bench runner can run external command #1028

bench runner can run external command #1028

grisumbras commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

sdarwin commented Jul 19, 2024

grisumbras commented Jul 19, 2024

codecov bot commented Jul 19, 2024 •

edited

Loading

sdarwin commented Jul 19, 2024

grisumbras commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

sdarwin commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

sdarwin commented Jul 19, 2024

grisumbras commented Jul 19, 2024

sdarwin commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

grisumbras commented Jul 19, 2024

sdarwin commented Jul 19, 2024

grisumbras commented Jul 25, 2024

sdarwin commented Jul 25, 2024 •

edited

Loading

sdarwin commented Jul 25, 2024

grisumbras commented Jul 26, 2024

sdarwin commented Jul 26, 2024

bench runner can run external command #1028

Are you sure you want to change the base?

bench runner can run external command #1028

Conversation

grisumbras commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

sdarwin commented Jul 19, 2024

grisumbras commented Jul 19, 2024

codecov bot commented Jul 19, 2024 • edited Loading

Codecov Report

sdarwin commented Jul 19, 2024

grisumbras commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

sdarwin commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

sdarwin commented Jul 19, 2024

grisumbras commented Jul 19, 2024

sdarwin commented Jul 19, 2024

cppalliance-bot commented Jul 19, 2024

grisumbras commented Jul 19, 2024

sdarwin commented Jul 19, 2024

grisumbras commented Jul 25, 2024

sdarwin commented Jul 25, 2024 • edited Loading

sdarwin commented Jul 25, 2024

grisumbras commented Jul 26, 2024

sdarwin commented Jul 26, 2024

codecov bot commented Jul 19, 2024 •

edited

Loading

sdarwin commented Jul 25, 2024 •

edited

Loading