Releases: sosy-lab/benchexec
Release 1.16
- Support for energy measurements if cpu-energy-meter is installed.
- Several small bug fixes and improvements
Release 1.14
- Updated tool-info modules for all participants of SV-COMP'18.
- Extended support for variable replacements in table-definitions of table-generator.
Release 1.13
- For Debian/Ubuntu, the
.deb
package is now the recommended way of installation,
because it automatically configures cgroups as necessary. - BenchExec now automatically attempts to use the sub-cgroup
system.slice/benchexec-cgroup.service
if it does not have access to the current cgroup.
This means that if you followed our installation instructions
for systems with systemd, there is no need anymore to manually
put your shell into the correct cgroup. - Several smaller bug fixes for table-generator:
#249, #250, #259, #260, #271, #272 - For users of the Python API of RunExecutor,
different file names can now be specified for stdout and stderr of the tool. - Some new tool-info modules and updates for SV-COMP'18.
Release 1.12
- Fix execution of runs specified with
<withoutfile>
tags
in the benchmark definition: the name of the run was missing
from the command-line in BenchExec 1.11.
Release 1.11
table-generator
can now be given result XML files as arguments in addition to a table-definition XML file (with parameter-x
).
In this case, it will use the column definitions from the latter for tables with the separately given results.- The directory
contrib
of the repository now contains a scriptstatistics-tex.py
, which can export summary data for benchmark results (e.g., number of solved tasks, average CPU time, etc.) to LaTeX. - The dummy tools
true
andfalse
, which could be used for testing a BenchExec installation, are replaced with a more generic dummy tool calleddummy
. - A few minor bug fixes and performance optimizations.
A new paper about BenchExec called Reliable Benchmarking: Requirements and Solutions is now available.
Please note that support for Python 3.2 and 3.3 is deprecated.
Furthermore, the support for "sudo mode" (parameter --user
/--users
) is also deprecated.
All deprecated features will be removed in BenchExec 2.0.
Release 1.10
This release brings several smaller and medium-sized features:
- Tool-info modules for all participants of SV-COMP'17,
and support for results of the categorycorrect-unconfirmed
,
which is used by SV-COMP if witness validation was not successful.
To conform with SV-COMP's definitions, violations of the SV-COMP reachability propertyunreach-call
will now be reported asfalse(unreach-call)
instead offalse(reach)
. - Measurement of block I/O if the
blkio
cgroup controller is available
(experimental, please read the documentation!). - Measurement of the energy used by the CPU for a run,
if the tool cpu-energy-meter is installed on the system
(experimental, please read the documentation!). - Ability to limit the disk space a tool can occupy in container mode.
- Various minor improvements to make container mode more robust.
- The feature for executing benchmarks under different user accounts with sudo
is now marked as deprecated and may be removed in the future,
consider using the container mode instead for isolating runs
(cf. issue #215). table-generator
is now more flexible:- Builtin support for certain unit conversions,
such that the scale factor does not always need to be explicitly specified.
Furthermore, unit conversions now work even if the values already have a unit. - Column titles can be manually specified with the
displayTitle
attribute`. - What columns are relevant for the "diff" table can be configured.
- Builtin support for certain unit conversions,
Please also note that we are considering dropping the support for Python 3.2
and maybe 3.3 in BenchExec 2.0 (to be released in a few weeks).
If this is a problem for you, please tell us in issue #207.
Release 1.9
The main feature of this release is the addition of a container mode that allows to isolate runs from each other and from the host, for example preventing filesystem and network accesses. It also allows to collect and store all files created by the tool in a run. The container mode is still in beta and disabled by default for now, it will be enabled by default in BenchExec 2.0. Please try it out and tell us your experiences!
Further changes:
table-generator
now supports HTTP(S) URLs to be given for result XML files
to allow generating tables for results without needing to download them first.
The HTML tables will contain correct links to the log files.- New SV-COMP property deadlock supported by
benchexec
. - The parameters
--rundefinition
and--tasks
ofbenchexec
now support wildcards. - Rounding of very small and very large values in
table-generator
has been fixed. - The default font for HTML tables has changed,
it is now a font that supports correctly aligned digits.
Release 1.8
benchexec
now compresses results by default: XML result files
are compressed with BZip2, and log files are stored within a ZIP archive.
This can reduce the necessary disk space significantly
(typically these logs compress very well),
and for large benchmark sets it reduces the number of necessary files,
which can make dealing with the results much faster.
The previous behavior can be restored with the parameter--no-compress-results
.table-generator
now supports benchmark results where the log files
are stored in a ZIP file instead of a regular directory.
All features continue to work with compressed results,
including extraction of values from log files and viewing log files from HTML tables
(cf. table-generator documentation for more details).
Compressed and uncompressed results are handled transparently and can be mixed,
and using results that were manually compressed or decompressed
is also supported.
Release 1.7
- Fix
table-generator
behavior for columns where different cells have different units:
The release notes for 1.6 claimed that these columns are treated as text column,
when instead they were rejected. Now they are treated as text.
Note that BenchExec does not create such columns itself, so this should not affect most users. - Fix computation of scores according to the SV-COMP scoring scheme:
if the expected result is for examplefalse(valid-deref)
and the tool returnsfalse(valid-free)
,
the resulting score is the one for a wrong false answer (-16 points),
not the one for a wrong true answer (-32 points).
The latter score is only given if the tool actually answerstrue
incorrectly. - Change result classification, if the returned answer does not belong to the property of the task,
for example, if the tool returnstrue
instead ofsat
for a task with categorysatisfiability
,
or if the tool returnsfalse(no-overflow)
when it should not even check for overflows.
Now these results are classified as unknown (with score 0),
previously these were treated as wrong answers. - Fix escaping of links in HTML tables, e.g., to log files with special characters in their name.
This was broken in 1.6.
Release 1.6
This release brings several improvements to table-generator
:
table-generator
now rounds measurement values in a scientifically correct way,
i.e., with a fixed number of significant digits, not with a fixed number of decimal places.
The attributenumberOfDigits
of<column>
tags in table-definition files
now also specifies significant digits, not decimal places.
By default, in HTML tables all fractional values are now rounded (e.g., time measurements)
and all integer values continue without rounding (e.g., memory measurements),
previously only "time" columns were rounded.
The remaining rounding-related behavior stays unchanged:
In CSV tables, values are not rounded by default,
and ifnumberOfDigits
is explicitly given for a column,
it's value will always be rounded in both HTML and CSV tables.table-generator
now automatically extracts units from the cells in a column
and puts them into the table header.- In HTML tables, numeric values are now aligned at the decimal point,
and text values are left aligned (previously both were right aligned). table-generator
now allows to convert values from one unit into another.
So far this is only implemented for values that do not have a unit attached to them,
and both the target unit and the scale factor need to be specified explicitly
in the<column>
tag.
This can be used for example to show memory measurements in MB instead of Bytes in tables.table-generator
now allows columns with links to arbitrary files to be added to tables.table-generator
does not handle columns where cells have differing units wrongly anymore.
Previously, the unit was simply dropped, leading to wrong values for statistics.
Now such columns are treated as text and no statistics are generated.
(Note that BenchExec never creates such columns by itself,
only if values are extracted from the tool output this could happen).
Other changes:
- The behavior of
benchexec --timelimit
was changed slightly,
if a value forhardtimelimit
was given in the benchmark-definition file.
If a time limit is specified on the command line, this now overrides both soft and hard time limit. - Implementation of tool-info modules got easier because the
test_tool_info
helper got improved
(it now allows to test the function for extracting results from tool outputs). - Several tool-info modules of tools participating in SV-COMP got improved.
- Simplified cgroups setup for systemd systems.
- Improved documentation.