KernelStrider_Tutorial

Here is a step-by-step tutorial on how to build and install KernelStrider and use it with ThreadSanitizer to reveal data races in a Linux kernel module. The tutorial also shows how to use dynamic annotations to assist the race detector and eliminate false positives.

Note. This tutorial describes how to use the version of KernelStrider available in the repository. If you need a tutorial that corresponds to a given release of KernelStrider, doc/tutorial.txt in its sources.

Overview

KernelStrider instruments the kernel modules under analysis ("target modules"). When the code of the target modules is executed, KernelStrider collects information about the memory locations accessed by that code, about the functions called, etc. ThreadSanitizer analyzes these data ("the trace") in the user space and reveals data races in the target modules.

The detailed information about what ThreadSanitizer can do and how it works is here: http://code.google.com/p/data-race-test/

Currently, KernelStrider can be used on x86 systems only, both 32-bit and 64-bit systems are supported.

IMPORTANT
It is not recommended to use KernelStrider on the machines holding important data or providing important services.

Main components of KernelStrider operate in the kernel space. The system instruments the modules under analysis, collects information about how these modules operate and allows to change their behavior. This creates both a security hole and a potential for system instability, especially if the kernel modules under analysis are faulty.

There is no warranty. If you use KernelStrider, you do so at your own risk.

Prerequisites

GCC 4.0 or newer, at least C and C++ compilers are needed.
GNU make.
Linux kernel version 3.2 or newer with loadable module support and debugfs support enabled in the configuration (see also the notes below).
Everything necessary to build Linux kernel modules for the currently running kernel.
CMake 2.6 or newer, 2.8 or newer is recommended.
readelf and addr2line. These tools are usually present (they are from "binutils" package).
Development files for libelf library. Look for a package named "libelf-devel", "libelf-dev", "elfutils-libelf-devel", "libelfutils-devel", "lib64elfutils-devel" or something like that for your Linux distribution.
Development files for libdw library. They may be included in some of the elfutils development packages like such files for libelf or it may be provided in a separate package, libdw-dev(el), libdw1-dev(el) or the like.
libebl libraries (needed by libdw to determine the source lines for the instruction addresses). Usually, they are installed automatically because libdw depends on them but if this is not the case, they should be installed separately.
Python 2.7 or newer. It is needed, for example, for kedr.py helper script that manages the kernel-mode components of the system. It is possible to get away without it, although each component needs to be loaded manually in this case (see the instructions in this guide). Note that some distros may provide /usr/bin/python2.7, for example, but not /usr/bin/python. For convenience, it is recommended to create /usr/bin/python as a symlink to the former.

Notes

LZO compression is used by KernelStrider and should be enabled in the kernel (CONFIG_LZO_COMPRESS is set to "y" or "m").

If you would like to resolve the addresses in the report on the detected data races, that is, to find the matching lines of the source code of the module under analysis, CONFIG_DEBUG_INFO should also be set to "y" in the kernel configuration ("Kernel hacking" - "Compile the kernel with debug info").

It is also convenient (although not required for this particular tutorial) to have the following parameters set in the kernel configuration:

CONFIG_DEBUG_SPINLOCK=y ("Kernel hacking" - "Spinlock and rw-lock debugging: basic checks") - among other things, the operations with spinlocks are not inlined if this option is set to "y", which makes these operations detectable by our system.
CONFIG_FUNCTION_TRACER=y ("Kernel hacking" - "Tracers" - "Function tracer") - although KernelStrider does not use the function tracer (Ftrace) subsystem, this option has a useful side effect: it makes sure that each function of the target module is at least 5 bytes long (size of the relative near jump instruction) and is therefore instrumentable by KernelStrider core. Sometimes it can be necessary for all the functions of the target module to be "visible" to KernelStrider, even the short ones.

There are also kernel patches that help KernelStrider detect more synchronization constructs and therefore reduce the number of false positives it produces even further in some cases. The patches are available here.

If, for example, you build the kernel yourself rather than use a distro-packaged one, consider applying these patches too and setting CONFIG_KEDR_ANNOTATIONS=y.

This is not required for this particular tutorial but is often handy for the real-world tasks.

Building ThreadSanitizer

Get the source code of ThreadSanitizer here: http://code.google.com/p/data-race-test/source/checkout

Here we assume the source code is in "/home/user/temp/data-race-test".

To be able to analyze Linux kernel modules, we need to build the "offline" variant of ThreadSanitizer.

$ cd /home/user/temp/data-race-test/tsan
$ make VALGRIND_ROOT= GTEST_ROOT= PIN_ROOT= OFFLINE=1 l64d

"l64d" make target used above is for 64-bit Linux systems, For 32-bit systems, specify "l32d" instead.

ThreadSanitizer executable should now be in /home/user/temp/data-race-test/tsan/bin. On 64-bit x86 systems, it is "amd64-linux-debug-ts_offline", on 32-bit ones - "x86-linux-debug-ts_offline" or something like that.

It is convenient to make a symlink to that executable somewhere in $PATH. For example, (assuming /usr/local/bin/ is in $PATH):

# ln -s /home/user/temp/data-race-test/tsan/bin/amd64-linux-debug-ts_offline \
/usr/local/bin/ts_offline

Building and Installing KernelStrider

Get the latest release of KernelStrider here: https://github.com/euspectre/kernel-strider. Unpack the archive to a directory of your choice.

Alternatively, you can checkout the latest sources from the repository. The sources of KernelStrider are in "sources" subdirectory in this case.

KernelStrider only supports out-of-source builds, that is, it cannot be built in the directory where its sources lie. The source tree is not changed during the build.

Here we assume the top working directory is /home/user/temp. Create a build directory and configure KernelStrider:

$ mkdir build
$ cd build
$ cmake <path_to_KernelStrider_sources>

By default, KernelStrider is configured to be installed to /usr/local tree. If you would like to install it to another location, say, /home/user/apps, run CMake as follows instead:

$ cmake -DCMAKE_INSTALL_PREFIX=/home/user/apps \
<path_to_KernelStrider_sources>

In this tutorial, we assume KernelStrider will be installed to the default location.

Now build KernelStrider:

$ make

If you like, you can build and run KernelStrider's self-tests before installing (root privileges are needed):

# make check

Install KernelStrider. If you are installing it to the default location, root privileges may be needed.

# make install

Now KernelStrider should be installed and ready to use.

Building the Target Module

In this tutorial, a sample kernel module, "buggy01", provided with KernelStrider will be used. As its name implies, "buggy01" has concurrency problems that we will try to reveal here using KernelStrider and ThreadSanitizer.

The source code of the module was installed to /usr/local/share/kedr/examples/buggy01 when KernelStrider was installed. Now you can copy it and build the module:

$ cp -r /usr/local/share/kedr/examples/buggy01 /home/user/temp
$ cd /home/user/temp/buggy01
$ make

buggy01.ko will be built.

Building the Test Application

KernelStrider also provides a simple application ("test_buggy01") that forks and reads the file in debugfs maintained by "buggy01" module from both the child and the parent process.

The source code of the application is installed to /usr/local/share/kedr/examples/test_buggy01

You can copy the source code and build the application as follows:

$ cp -r /usr/local/share/kedr/examples/test_buggy01 /home/user/temp
$ cd /home/user/temp/test_buggy01
$ gcc -Wall -o test_buggy01 test_buggy01.c

Analyzing the Target Module

Note that the operations with the kernel-space parts of KernelStrider require root privileges.

1. The Kernel-mode Components

First, load the kernel-mode components of KernelStrider:

# /usr/local/bin/kedr.py start --tools=KernelStrider --targets=buggy01

Actually, "--targets" accepts a comma-separated list of kernel modules to be analyzed.

2. Debugfs

In this tutorial, we assume that debugfs is mounted to /sys/kernel/debug. If it is not, you can mount it:

# mount -t debugfs none /sys/kernel/debug

3. Collecting Data

3.1

The collected data are transferred by KernelStrider from the kernel to the user space through a memory buffer and then are stored in a file. If the data stream is intense and the user-space part of KernelStrider cannot keep up, the information about the newer events in the target module will be lost making the collected data unusable. You can view the current count of lost events in "/sys/kernel/debug/kedr_simple_trace_recorder/events_lost".

To avoid problems due to lost events, you have the following options.

Instruct KernelStrider to use a larger memory buffer for output. Use "--nr_data_pages" option of kedr.py to set the size of the buffer (the number of memory pages) when loading the tool. The value must be a power of 2 and it must be less than or equal to 65536. Default value: 128.
Enable sampling to reduce the intensity of the data stream. See this guide for details.

Both approaches can be used at the same time.

3.2

Load the user-space part of the system and specify the path to the file where the collected data should be saved (requires root privileges):

# /usr/local/bin/kedr_st_recorder trace.dat &

If "trace.dat" file already exists, it will be cleared automatically.

3.3

Now it is time to load the target module.

# insmod /home/user/temp/buggy01/buggy01.ko

3.4

To be able to analyze the target's code, you need to make that code execute somehow. One could operate on the device that the module services, run some specific tests on it, etc.

The target module used in this tutorial maintains a read-only file in debugfs. You can use a simple application ("test_buggy01") provided with KernelStrider to read that file from two processes. Alternatively, you may just 'cat' that file a couple of times.

# /home/user/temp/test_buggy01/test_buggy01

When you are done with the target module, unload it:

# rmmod buggy01

3.5

The user-space part of the output system ("kedr_st_recorder" process) should stop automatically when the target is unloaded. If it fails to stop for some reason, send SIGINT or SIGTERM to it ('kill -INT <PID>' or 'kill <PID>').

Check if the output system has been able to save information for all events:

# cat /sys/kernel/debug/kedr_simple_trace_recorder/events_lost

If "events_lost" contains a non-zero value, you would probably want to stop KernelStrider and then retry from the first step. This is because the saved data could be unreliable in this case (especially if some "lock"/"unlock" events have been lost).

3.6

If you would like to check some other operations with the target module and record the corresponding trace, you can start from step 3.2 (probably with a different file to save the trace to) and go on as described above.

3.7

When you have collected all the traces you wanted to, the kernel-mode components of KernelStrider can be unloaded.

# /usr/local/bin/kedr.py stop

3.8

To analyze the collected data, "tsan_process_trace" tool can be used. It automatically converts the trace to the format ThreadSanitizer understands, feeds the trace to the latter and outputs the report. The tool looks for ThreadSanitizer (ts_offline, etc.) in $PATH by default.

$ /usr/local/bin/tsan_process_trace /home/user/temp/buggy01/buggy01.ko \
< trace.dat > tsan_report.txt

tsan_report.txt will now contain the report of ThreadSanitizer. It is described in the section 4 how to interpret it.

The tool uses the binary file of the target module to determine the addresses of ELF sections and (if debug info is available in the module) the appropriate source lines.

To get more information about the abilities of tsan_process_trace tool, execute

$ /usr/local/bin/tsan_process_trace --help

4. Analyzing the Trace

4.1

Before you begin to analyze the report produced by ThreadSanitizer, there is a couple of important points to consider.

First, the number and the contents of the warnings about potential data races may vary. It depends on the actual interleaving of the threads that execute the code of the target modules.

Second, not all warnings describe real races. That is, false positives are possible.

When ThreadSanitizer is used in "pure happens-before" mode (default), a common source of false positives is that not all ordering relationships between the different parts of the target module are known to KernelStrider yet. For example, if the target registers some callback function with the kernel proper or some other module, usually that callback cannot be called before it is registered. If KernelStrider does not know about this particular relationship, this may result in false positives. In addition, some synchronization primitives used in the kernel are inlined and difficult to detect at the level of binary code.

When ThreadSanitizer is used in "hybrid" mode, the amount of false positives may grow further but the tool may find more real races too, see http://code.google.com/p/data-race-test/wiki/PureHappensBeforeVsHybrid

You can help KernelStrider recognize synchronization and other ordering relationships in the target module by placing special annotations in it.

We are working on KernelStrider for it to be able to detect ordering relationships in some widely used kinds of kernel modules (e.g. networking). This is a work in progress and even with that, annotations are still needed sometimes.

We will describe below how to use dynamic annotations in the code of the target module to reduce the number of the reported false positives.

4.2

Let us look at the report (tsan_report.txt). Here is one of the warnings you are likely to encounter there:

WARNING: Possible data race during read of size 4 at 0xffff88004b3dbda8:
T2 (L{}):
#0  sample_open (module.c:43):
Concurrent write(s) happened at (OR AFTER) these points:
T1 (L{}):
#0  sample_init_module (module.c:122):
Location 0xffff88004b3dbda8 is 8 bytes inside a block
starting at 0xffff88004b3dbda0 of size 16 allocated by T1 from heap:
#0  kmalloc (slab_def.h:161)
#0  kzalloc (slab.h:340)
#1  sample_init_module (module.c:122)

Note that the same index of the stack items (#0 for kmalloc and kzalloc above) is used for the "calls" ("substitutions") of inline functions.

ThreadSanitizer suspects there is a data race between threads T2 and T1. You can find the list of threads at the end of the report.

T2 reads 4 bytes starting from address 0xffff88004b3dbda8, this happens in sample_open() function in a statement that begins at the line 43 of "module.c" file. Here is the fragment of the source code:

43  pr_info(TEST_MSG_PREFIX "Opening, count is %d.\n",
44      some_data->count);

This is the reading of "some_data->count".

Another candidate would be reading of the value of "some_data" itself (it is a global variable). However this is a 64-bit system and "some_data" is 8 bytes in size rather than 4 mentioned in the report.

"L{}" means that T2 did not hold any locks at that moment.

Thread T1 writes 4 bytes to the same memory area in sample_init_module() function, in the statement at module.c:122 or a bit later. Note that ThreadSanitizer reports the exact location in the code for the first of the conflicting accesses but may be less accurate with the remaining ones. For these, it reports where the corresponding block of code starts ("block" means "a region of the binary code without function calls, backward jumps and some other constructs" in this case).

122 some_data = kzalloc(sizeof(struct some_data), GFP_KERNEL);
123 if (some_data == NULL) {
124     err = -ENOMEM;
125     goto fail0;
126 }
127 some_data->count = 0; //< A possibly conflicting access to "count"
128
129 dir_dentry = debugfs_create_dir("buggy01", NULL);

Here you can see that the conflicting access to "some_data->count" is actually at line 127.

The report also states, the data that were accessed belong to a structure of size 16 and are located there at the offset of 8 bytes from the beginning. The call stack of the allocation of that structure is provided. This way we can confirm that the possibly conflicting operations were accessing "some_data->count": "*some_data" was allocated at line 122 and "count" is indeed at the offset of 8 bytes from the beginning of that structure.

You can find more information about ThreadSanitizer reports here: http://code.google.com/p/data-race-test/wiki/UnderstandingThreadSanitizerReports

So, is that a data race? Is that possible for these operations to execute concurrently?

In this case, the answer is "most likely no". The read access mentioned in this report happens in sample_open(), a file operation provided by "buggy01" to maintain its file in debugfs. But "some_data->count" is written to in the init function of the target module before that file is created by debugfs_create_file() at line 136.

sample_open() cannot be called for a file before debugfs_create_file() creates that file. So these two accesses to "some_data->count" are extremely unlikely to happen concurrently, if that is possible at all.

Indeed, there are at least the following facts about the file operation callbacks (sample_open(), sample_read() and sample_release() in this example) for the files in debugfs that KernelStrider is unaware of by default:

no file operation callback can start before debugfs_create_file() starts for the corresponding file;
no file operation callback can be completed later than debugfs_remove() called for the corresponding file returns;
no file operation callback can be completed later than the exit function of the target module starts.

These are the examples of a so called "happens-before" (or "happened-before") relation, a partial order on the events observed by KernelStrider in the target module.

The detailed information about this relation can be found here:

http://en.wikipedia.org/wiki/Happened-before

http://code.google.com/p/data-race-test/wiki/ThreadSanitizerAlgorithm

4.3

So, one needs to tell KernelStrider somehow that if the start of the execution of debugfs_create_file() is observed before the start of execution of sample_open(), the former actually happens before the latter.

To do this, you can "annotate" the relevant locations in the code with the special macros KernelStrider provides, rebuild "buggy01", repeat all the operations with it, collect a new trace and feed it to ThreadSanitizer again as described above, etc.

The annotated variant of "buggy01" is in "/usr/local/share/kedr/examples/buggy01_annotated".

The annotations are "dynamic" in a sense that they are not comments, that is, they are not ignored by the compiler but rather expand to the appropriate code fragments and are processed by KernelStrider when the target module is loaded.

If your kernel is patched to enable the annotations (and CONFIG_KEDR_ANNOTATIONS is 'y') then no changes are needed in the "Kbuild" file of the target module. Otherwise, you need "kedr_annotations.c" and and "kedr_annotations.h" provided by KernelStrider (they are installed to "/usr/local/share/kedr/annotations/"). Copy these files to the directory where the source code of the target module is. #define CONFIG_KEDR_ANNOTATIONS to a non-zero value in the compiler options, usually - in "ccflags-y" in "Kbuild" file of the target module. You also need to instruct the build system of the target module to compile "kedr_annotations.c" and link the resulting object file into the target module.

Here is "Kbuild" file for "buggy01_annotated":

MODULE_NAME := buggy01

ccflags-y := -DCONFIG_KEDR_ANNOTATIONS=1 -I$(src)
obj-m := $(MODULE_NAME).o
$(MODULE_NAME)-y := module.o kedr_annotations.o

Note that a target module containing the annotations described here can be used normally even if KernelStrider is not loaded.

4.4

Add

    #include <linux/kedr_annotations.h>

or, if the support for annotations is not provided by the kernel,

    #include "kedr_annotations.h"

to the source files of the target module where you would like to use the annotations.

You also need to choose the identifiers for happens-before "links" ("arcs") you are going to specify in the code with the annotations. An ID is a non-zero unsigned long value. Sometimes it is convenient to use the address of some relevant object as an ID but in this example, the plain numbers are used, see "enum id_happens_before" in buggy01_annotated/module.c.

KEDR_ANNOTATE_HAPPENS_BEFORE(id) and KEDR_ANNOTATE_HAPPENS_AFTER(id) define a happens-before arc if "id" is the same.

Here is how the code of sample_open() can be annotated:

static int
sample_open(struct inode *inode, struct file *filp)
{
    int ret;
    /* The start of execution of sample_open() happens after
     * debugfs_create_file() starts for the corresponding file.
     * Note that this does *not* mean that sample_open() may start
     * right after debugfs_create_file() has started though. */
    KEDR_ANNOTATE_HAPPENS_AFTER(ID_CREATE_HB_OPEN);

    pr_info(TEST_MSG_PREFIX "Opening, count is %d.\n",
        some_data->count);

    mutex_lock(&some_lock);
    ++some_data->count;
    snprintf(some_data->buf, strlen(some_string) + max_len,
        "#%d: %s\n", some_data->count, some_string);
    filp->private_data = some_data->buf;
    mutex_unlock(&some_lock);

    ret = nonseekable_open(inode, filp);

    /* The end of execution of sample_open() happens before
     * debugfs_remove() starts for the corresponding file.
     * It also happens before the exit function of this module
     * starts. */
    KEDR_ANNOTATE_HAPPENS_BEFORE(ID_OPEN_HB_REMOVE);
    KEDR_ANNOTATE_HAPPENS_BEFORE(ID_OPEN_HB_EXIT);
    return ret;
}

sample_read() and sample_release() are annotated in a similar way.

The creation of the file in debugfs can be annotated as follows in sample_init_module():

    KEDR_ANNOTATE_HAPPENS_BEFORE(ID_CREATE_HB_OPEN);
    KEDR_ANNOTATE_HAPPENS_BEFORE(ID_CREATE_HB_READ);
    KEDR_ANNOTATE_HAPPENS_BEFORE(ID_CREATE_HB_RELEASE);
    file_dentry = debugfs_create_file("data", S_IRUGO, dir_dentry, NULL,
        &sample_fops);

Note that KEDR_ANNOTATE_HAPPENS_BEFORE() is usually placed right before the operation to be annotated, KEDR_ANNOTATE_HAPPENS_AFTER() - right after the operation.

Here is the annotated exit function of "buggy01" module:

static void __exit
sample_exit_module(void)
{
    KEDR_ANNOTATE_HAPPENS_AFTER(ID_OPEN_HB_EXIT);
    KEDR_ANNOTATE_HAPPENS_AFTER(ID_READ_HB_EXIT);
    KEDR_ANNOTATE_HAPPENS_AFTER(ID_RELEASE_HB_EXIT);

    pr_info(TEST_MSG_PREFIX "Opened %d time(s).\n",
        some_data->count);

    debugfs_remove(file_dentry);
    /* The following 3 annotations are redundant because each callback
     * finishes before the exit functions of the module starts. The
     * annotations with ids 'ID_*_HB_EXIT' let KernelStrider know about
     * that. Still, leaving the annotations here makes no harm. */
    KEDR_ANNOTATE_HAPPENS_AFTER(ID_OPEN_HB_REMOVE);
    KEDR_ANNOTATE_HAPPENS_AFTER(ID_READ_HB_REMOVE);
    KEDR_ANNOTATE_HAPPENS_AFTER(ID_RELEASE_HB_REMOVE);

    debugfs_remove(dir_dentry);
    kfree(some_data->buf);
    kfree(some_data);
    return;
}

4.5

There is one more thing that you might need to tell KernelStrider about. Consider "struct file" object pointed to by 'filp' in the file operation callbacks. The memory for this object is allocated by the kernel before sample_open() is called and deallocated after sample_release() is called. KernelStrider is currently not aware of this.

Suppose after a thread opened, accessed and closed the file, the memory occupied by "struct file" object has been deallocated. Suppose the target module or some other kernel-mode component allocates a memory block of a suitable size after that. The chances are, it will be given the very block previously occupied by that "struct file" object. As a consequence, the accesses to these unrelated objects could be considered conflicting which would lead to more false positives.

Another kind of annotations can be used to tell KernelStrider that memory occupied by the "struct file" object appeared at the beginning of sample_open() and disappeared at the end of sample_release(). This is not quite accurate but it is enough in this case. Here they are:

KEDR_ANNOTATE_MEMORY_ACQUIRED(addr, size)
KEDR_ANNOTATE_MEMORY_RELEASED(addr)

You can see in sample_open() and sample_release() (buggy01_annotated/module.c) how these annotations are used.

4.6

Now that you have the annotated variant of "buggy01" module, you can build it and perform the steps described earlier in this tutorial once again to get a report listing possible data races.

Note that when using "tsan_process_trace" or "tsan_report_addr2line" tools, the correct ".ko" file with debug information should be specified. This seems obvious but still it is easy to forget and use the old "buggy.ko" there (especially if you have several variants of the target module around at the same time). As a consequence, the source lines could be incorrect in the report.

Also note that ThreadSanitizer may report not all races between accesses to a given memory area. As a result, a real race may remain missing from the report if there is a false positive involving the same memory location. When you specify the relevant happens-before arcs and object lifetimes (KEDR_ANNOTATE_MEMORY_ACQUIRED/RELEASED), the false positives go away and the real races they have been hiding are finally reported.

The report you have obtained this time is likely to contain the following 3 warnings:

[W1]
WARNING: Possible data race during read of size 8 at 0xffff88003795ab80:
T2 (L{L1}):
#0  sample_open (module.c:81):
Concurrent write(s) happened at (OR AFTER) these points:
T1 (L{}):
#0  sample_init_module (module.c:203):
Location 0xffff88003795ab80 is 0 bytes inside a block
starting at 0xffff88003795ab80 of size 16 allocated by T1 from heap:
#0  kmalloc (slab_def.h:161)
#0  kzalloc (slab.h:340)
#1  sample_init_module (module.c:174)
Locks involved in this report (reporting last lock sites): {L1}
L1 (0xffffffffa01500e0)
#0  sample_open (module.c:79):

[W2]
WARNING: Possible data race during write of size 1 at 0xffff8800347f6700:
T2 (L{L1}):
#0  sample_open (module.c:81):
Concurrent write(s) happened at (OR AFTER) these points:
T1 (L{}):
#0  sample_init_module (module.c:203):
Location 0xffff8800347f6700 is 0 bytes inside a block
starting at 0xffff8800347f6700 of size 38 allocated by T1 from heap:
#0  kmalloc (slab_def.h:161)
#0  kzalloc (slab.h:340)
#1  sample_init_module (module.c:203):
Locks involved in this report (reporting last lock sites): {L1}
L1 (0xffffffffa01500e0)
#0  sample_open (module.c:79):

[W3]
WARNING: Possible data race during read of size 4 at 0xffff88003795ab88:
T3 (L{}):
#0  sample_open (module.c:76):
Concurrent write(s) happened at (OR AFTER) these points:
T2 (L{L1}):
#0  sample_open (module.c:80):
Location 0xffff88003795ab88 is 8 bytes inside a block
starting at 0xffff88003795ab80 of size 16 allocated by T1 from heap:
#0  kmalloc (slab_def.h:161)
#0  kzalloc (slab.h:340)
#1  sample_init_module (module.c:174)
Locks involved in this report (reporting last lock sites): {L1}
L1 (0xffffffffa01500e0)
#0  sample_read (module.c:127):

Let us analyze these warnings one by one.

4.6.1

Using the information about the accessed memory location as a hint, we can assume the warning W1 is about a read from and a write to "some_data->buf" (the first field, i.e. the field at the offset 0, of "struct some_data" allocated at module.c:174).

The memory is read in sample_open() at module.c:81 with lock L1 held ("L{L1}"):

79  mutex_lock(&some_lock);
80  ++some_data->count;
81  snprintf(some_data->buf, strlen(some_string) + max_len,
82      "#%d: %s\n", some_data->count, some_string);

Note that this is about reading the value of "some_data->buf" itself rather than about accessing the data it points to.

According to the report, the conflicting write happened in sample_init_module() at or after line 203 of "module.c":

203 some_data->buf = kzalloc(strlen(some_string) + max_len,
        GFP_KERNEL);

As you can see, "some_data->buf" is indeed assigned a value at line 203.

Is this actually a data race?

When looking through the code (buggy01_annotated/module.c), you have probably noticed that "some_data->buf" is assigned a value in sample_init_module() after the file in debugfs has been created by debugfs_create_file() at line 194. The read from "some_data->buf" happens when someone opens that file.

As it is common in the kernel, as soon as some facility is made available, some part of the system may try to access it. In this case, as soon as the file in debugfs has been created, someone may try to open it and read data from it, even if the sample_init_module() has not finished yet.

It is possible that someone tries to open "/sys/kernel/debug/buggy01/data" after debugfs_create_file() has created it but before "some_data->buf" is assigned a meaningful value at line 203. If that happens, a kernel oops is likely to happen. The "time window" is very narrow but still.

So, yes, this is a race. As it is often the case for the races between initialization and usage of a resource, this one may have serious consequences.

4.6.2

The lines of the source code for the conflicting accesses in the warning W2 are the same as in W1 but a different object is accessed there. Indeed, consider information about the memory location and take into account that the size of the area accessed concurrently is 1 byte in this case rather than 8 bytes in W1.

The accessed memory location is "0 bytes inside a block <...> of size 38 allocated by T1 from heap" in sample_init_module() at module.c:203. The buffer pointed to by "some_data->buf" is allocated there and the memory location of interest is the first byte of that buffer.

The first conflicting write is in sample_open() at module.c:81:

79  mutex_lock(&some_lock);
80  ++some_data->count;
81  snprintf(some_data->buf, strlen(some_string) + max_len,
82      "#%d: %s\n", some_data->count, some_string);

This time, a write to the buffer by snprintf() function is considered.

A conflicting write happens in sample_init_module() at or after module.c:203:

203 some_data->buf = kzalloc(strlen(some_string) + max_len,
204     GFP_KERNEL);
205 if (some_data->buf == NULL) {
206     err = -ENOMEM;
207     goto fail2;
208 }
209 some_data->buf[0] = 0;

As you can see from this code fragment, the first byte of the buffer is actually accessed at line 209.

This is also a real data race for the same reason as the race reported in the warning W1. They both are the result of a fact that the memory buffer is initialized after it might start being used.

4.6.3

The warning W3 concerns the accesses to "some_data->count" (the field at the offset 8 of the structure "struct some_data" allocated at module.c:174).

The read happens at module.c:76 without any locks held, the conflicting write happens at or after module.c:80 with lock L1 held. Both accesses are in sample_open() called from two different threads. Here is the code fragment:

76  pr_info(TEST_MSG_PREFIX "Opening, count is %d.\n",
77      some_data->count);
78
79  mutex_lock(&some_lock);
80  ++some_data->count;

The conflicting accesses are the reading of "some_data->count" as an argument for pr_info() and the increment at line 80.

sample_open() can be executed concurrently by several threads for different "struct file" objects. "some_data" structure is shared between the threads, so this is a real data race too. To avoid it, pr_info() should be moved after "mutex_lock(&some_lock)" at line 79.

Congratulations! You have found 3 real data races in "buggy01" module with the help of KernelStrider and ThreadSanitizer during this tutorial.

KEDR

KernelStrider

RaceHound

KernelStrider_Tutorial

Overview

Prerequisites

Notes

Building ThreadSanitizer

Building and Installing KernelStrider

Building the Target Module

Building the Test Application

Analyzing the Target Module

1. The Kernel-mode Components

2. Debugfs

3. Collecting Data

3.1

3.2

3.3

3.4

3.5

3.6

3.7

3.8

4. Analyzing the Trace

4.1

4.2

4.3

4.4

4.5

4.6

4.6.1

4.6.2

4.6.3

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally