RFC-0015: Better Association of Perf Samples with Counters #4545

@LalitMaganti · 2026-01-23T17:27:23Z

github-actions[bot]
Bot Jan 23, 2026

📄 RFC Doc: 0015-perf-counters-refactor.md

Better Association of Perf Samples with Counters

Status: Decided

PR: N/A

Problem

When we collect perf samples today, we always collect them based on a timebase
which tells you "which hardware/software counter should this sample be collected
on". In most cases this is a time/cycle based counter which goes something like
"sample every 1 ms" or "sample every 1 mcycle" etc.

At the same time, the perf subsystem can also lookup a bunch of other counters
as well to determine their values. This are "follower counters" which allow
for tracking other metrics like page faults etc.

The main problem is that, when this data reaches trace processor today, the link
between the sample (mainly the callstack) and the counters is removed. That
means if you want to ask the question "what was the value of the perf counter
at sample X", it's very hard to do so. Really it requires joining across the
tables with (perf_sample_id, ts) which is both highly inefficient and
non-intuitive.

Design

We propose making the following changes:

Introducing a new __intrinsic_perf_counter_set table with the columns
perf_counter_set_id, counter_id. The counter set should be
a "set id" similar arg_set_id. The counter id points to the counter value in
the counter table
Introduce a new intrinisic function __intrinsic_perf_counter_for_sample
which, given a perf sample id annd a counter name, returns the counter value
for it.
Introduce a new view in the stdlib table perf_sample_with_counters in the
linux.perf module which is the join of perf_sample,
__intrinsic_perf_counter_set and counter table. Basically this is a fully
denormalized view of things which people can use for filtering/aggregation
at the cost of performance from having to do the joins.
A "reexport" __intrinsic_perf_counter_for_sample function in the stdlib
with a public name as linux_perf_counter_for_sample(sample_id).
Add a new column to __intrinsic_perf_sample (not exposed to public API for
now) called counter_set_id which mapes to __intrinsic_perf_counter_set.

Alternatives considered

Just having the normalized tables

It can be argued the denoramized perf_sample_with_counters is a footgun and
has the potential for people to write queries which are inefficient or just give
the data users don't expect (because for each sample it has multiple entries,
one per counter).

Upon discussion within the team, it was decided that the gains from being able
to have a single, join-less table to query out-weighed these considerations.

Open questions

N/A

💬 Discussion Guidelines:

This discussion is automatically synced with the RFC document
Please provide constructive feedback and suggestions

2026-01-23T17:29:17Z

github-actions[bot]
Bot Jan 23, 2026
Author

📝 RFC Document Updated

View changes: Commit History

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC-0015: Better Association of Perf Samples with Counters #4545

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

RFC-0015: Better Association of Perf Samples with Counters #4545

Uh oh!

Uh oh!

github-actions[bot] Bot Jan 23, 2026

Better Association of Perf Samples with Counters

Problem

Design

Alternatives considered

Just having the normalized tables

Open questions

Replies: 1 comment

Uh oh!

github-actions[bot] Bot Jan 23, 2026 Author

github-actions[bot]
Bot Jan 23, 2026

github-actions[bot]
Bot Jan 23, 2026
Author