CPALockator's Linux device driver data race detection benchmarks #3

sim642 · 2021-06-12T10:24:16Z

I recently read the following paper about CPALockator: Analysis of Correct Synchronization of Operating System Components.
Basically CPALockator is a CPAchecker fork/branch/something that also does thread-modular analysis.

In the evaluation section, they mention having constructed a set of 425 data race detection benchmarks from Linux 4.2.6 device drivers. There's no mention of the availability of this benchmark set, but I managed to find this, which seems like it's the same set: https://gitlab.com/sosy-lab/software/ldv-benchmarks/-/tree/master/linux-4.2.6-races.

I wasn't aware of this earlier and I'm not sure why this didn't get added to sv-benchmarks for data-race. Nevertheless, this might be a useful set of benchmarks for us to look at because:

There's 425 of them.
They're from the Linux kernel (so should be sufficiently large).
Since both CPALockator and Goblint are thread-modular, there's hope that we can also analyze them successfully.

The same repository also seems to have some data-race benchmarks for Linux 4.18: https://gitlab.com/sosy-lab/software/ldv-benchmarks/-/tree/master/linux-4.18-races. And a handful of commit-based data-race benchmarks in Linux (although most don't have before vs after like the README claims): https://gitlab.com/sosy-lab/software/ldv-benchmarks/-/tree/main/ldv-commits-races.

Some of the benchmarks are in Unknowns subdirectories but still contain task definitions with expected verdicts (not sure based on what). These should probably be excluded to be on the safe side.

The text was updated successfully, but these errors were encountered:

sim642 · 2022-03-02T09:31:22Z

In goblint/analyzer#618 I realized that these benchmarks have some problems and not immediately usable.

LDV-specific model functions

A README says:

Thread creation is modeled by a function pthread_create with default signature, which creates a single thread, or pthread_create_N, which creates several instances at once.
Locking is organized by functions ldv_mutex_model_lock/ldv_mutex_model_unlock and ldv_spin_model_lock/ldv_spin_model_unlock.

The benchmarks don't define these LDV locking/unlocking functions themselves in terms of pthread, but rather assume the analyzer itself supports them. Similarly they use pthread_create_N (and a corresponding pthread_join_N) which are nonstandard and must be modeled as non-unique creations.

CPAchecker

The benchmarks are originally for CPAlockator, which is based on CPAchecker, which has a related configuration file: https://github.com/sosy-lab/cpachecker/blob/trunk/config/includes/lockator/linux.properties.
As seen there, it adds the above mentioned locking functions (and a few which weren't mentioned) for special lock handling.

Regarding the special thread creation, that is hard-coded into CPAchecker itself: https://github.com/sosy-lab/cpachecker/blob/2cfbe22b515fe9fedbd7629c62e9b93ddeda0a9a/src/org/sosy_lab/cpachecker/cfa/postprocessing/function/ThreadCreateTransformer.java#L68-L73.

Moreover, it defines a bunch of skippedvariables for other LDV-specific functions that also don't have definitions.

Klever

Somehow Klever has been used to generate these benchmarks from Linux source code. It's very much not described anywhere, how it does the environment modeling etc, but I managed to find this: https://github.com/ldv-klever/klever/blob/master/presets/jobs/specifications/linux/concurrency%20safety/synchronization%20primitives.aspect. That somehow defines the transformations that are applied to original source code to replace them with LDV-specific functions instead.

Since Goblint has been used to analyze non-transformed Linux source code, it actually has a bunch of LibraryFunctions handling for the original names of the functions (such that locking, etc. is recognized), but not for the LDV-transformed names.

Incorrect Frama-C generation

All the benchmarks have the following header comment:

/* Generated by Frama-C */

I'm not sure what that exactly means, but my best guess is that the combining/merging mechanism from Frama-C has been used (a la cilly). Although it might also have involved something additional (simplifications?).

For example, linux-4.2.6-races/Unsafes/u__linux-concurrency_safety__drivers---net---irda---ksdazzle-sir.ko.cil.i contains in usb_endpoint_dir_in the following:

  __retres = (int)epd->bEndpointAddress < 0;

Crucially, bEndpointAddress is of unsigned integer type, so __retres is always false, which in turn makes most of the code dead.
And that's not the only instance of such useless expression appearing.

Original Linux source

When looking for the same function in the source of that version of Linux, the code is completely different:

	return ((epd->bEndpointAddress & USB_ENDPOINT_DIR_MASK) == USB_DIR_IN);

These bit operations on an unsigned integer aren't trivially constant, so somewhere along the way the semantics have been changed.

sim642 self-assigned this Jun 15, 2021

sim642 added the new benchmark New benchmark to analyze label Dec 15, 2021

sim642 mentioned this issue Mar 1, 2022

ldv-benchmarks races benchmarking goblint/analyzer#618

Merged

5 tasks

sim642 added the goblint Goblint-specific problem label Mar 2, 2022

sim642 mentioned this issue Mar 2, 2022

Creating demonstrators for GobPie #22

Open

vesalvojdani mentioned this issue Mar 5, 2022

Original benchmarks from ldv-commits-races #23

Open

18 tasks

This was referenced Apr 14, 2022

Klever concurrency safety support goblint/analyzer#688

Merged

Add regenerated Klever Linux 5.5 concurrency safety benchmarks #27

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPALockator's Linux device driver data race detection benchmarks #3

CPALockator's Linux device driver data race detection benchmarks #3

sim642 commented Jun 12, 2021 •

edited

Loading

sim642 commented Mar 2, 2022

CPALockator's Linux device driver data race detection benchmarks #3

CPALockator's Linux device driver data race detection benchmarks #3

Comments

sim642 commented Jun 12, 2021 • edited Loading

sim642 commented Mar 2, 2022

LDV-specific model functions

CPAchecker

Klever

Incorrect Frama-C generation

Original Linux source

sim642 commented Jun 12, 2021 •

edited

Loading