Implement custom OpenTelemetry sampler to filter uninstrumented traces #8647

jimmygchen · 2026-01-12T05:11:20Z

Description

Adds a PrefixBasedSampler to filter out traces that don't originate from known instrumented code paths. This reduces noise from uninstrumented code paths when exporting to OTLP backends.

Root spans use the lh_ prefix to identify Lighthouse instrumented entry points. The sampler is used with OpenTelemetry's ParentBased sampler:

Root spans are sampled only if their name starts with lh_
Child spans automatically inherit their parent's sampling decision

This enables effective trace sampling for #8554 - without filtering, we get spans from uninstrumented code paths (e.g. fork_choice_write_lock only traces), making low sample rates ineffective at capturing meaningful instrumented traces.

Additional Info

The lh_ prefix approach eliminates the need to maintain an allowlist of span names - new instrumented spans just need the prefix to be exported. The prefix is kept short to minimize storage overhead in tracing backends.

Add AllowedRootSpanSampler to filter out traces that don't originate from known instrumented code paths. This reduces noise from library code and uninstrumented paths when exporting to OTLP backends. Uses the idiomatic OpenTelemetry ParentBased sampler pattern: - Root spans are sampled only if their name is in LH_BN_ROOT_SPAN_NAMES - Child spans automatically inherit their parent's sampling decision - Efficient head-based sampling with no per-span tracking overhead This enables effective trace sampling in production - without filtering, the majority of traces would be noise, making low sample rates ineffective at capturing meaningful instrumented code paths.

eserilev

just a few small things on my end

eserilev · 2026-01-12T23:18:42Z

beacon_node/lighthouse_tracing/src/lib.rs

+        _attributes: &[opentelemetry::KeyValue],
+        _links: &[Link],
+    ) -> SamplingResult {
+        if self.allowed_names.contains(&name) {


A small optimization could be to make allowed_names a set instead of a list. Right now the list of allowed spans is relatively small so it might not matter much until the list gets bigger

I just had a thought: perhaps we could do prefix instead, so we don't have to maintain this list at all, because longer term this could be quite bad devex - say I add a trace, but forgot to add to this new trace to the allowed list, then i build and deploy BUT couldn't find the trace and had to debug - this wastes some dev cycles.

I'm thinking to add a lh_ prefix (keeping it short so that it doesn't take up a lot of backend storage), what do you think?

I've added this - i think this is a better longer term solution than maintaining a list. Let me know your thoughts!

I agree, this seems much better

Looking much better without the noise \o/

eserilev · 2026-01-12T23:19:11Z

beacon_node/lighthouse_tracing/src/lib.rs

+pub struct AllowedRootSpanSampler {
+    allowed_names: &'static [&'static str],
+}


Maybe a few unit tests here could be nice?

Yes, added!

Replace the allowlist-based AllowedRootSpanSampler with a generic `PrefixBasedSampler` that filters spans by prefix. Root spans now use the `lh_` prefix to identify Lighthouse instrumented entry points. Changes: - Rename `lighthouse_tracing` to `tracing_samplers` and move to `common/` - Replace `AllowedRootSpanSampler` with `PrefixBasedSampler` - Remove all `SPAN_*` constants, use inline strings at call sites - Remove `LH_BN_ROOT_SPAN_NAMES` allowlist This eliminates the need to maintain an allowlist of span names. New instrumented spans just need the `lh_` prefix to be exported. The sampler is now generic and can be reused by validator_client.

eserilev

LGTM!

One more thought I had was to maybe add a custom proc macro so that we don't have to do

#[instrument(name = "lh_produce_unaggregated_attestation", skip_all, fields(%request_slot, %request_index), level = "debug")]
fn produce_unaggregated_attestation()

and instead do something like this

#[lh_instrument(skip_all, fields(%request_slot, %request_index), level = "debug")]
fn produce_unaggregated_attestation()

where the proc macro automatically appends lh_ to the function name

jimmygchen · 2026-01-14T04:07:53Z

LGTM!

One more thought I had was to maybe add a custom proc macro so that we don't have to do
#[instrument(name = "lh_produce_unaggregated_attestation", skip_all, fields(%request_slot, %request_index), level = "debug")]
fn produce_unaggregated_attestation()
and instead do something like this
#[lh_instrument(skip_all, fields(%request_slot, %request_index), level = "debug")]
fn produce_unaggregated_attestation()
where the proc macro automatically appends lh_ to the function name

Thanks, I see the convenience with the macro, but IMO the gain is quite trivial to justify adding custom macro for this - it may not be immediately obvious what this macro does without looking at the implementation and might add confusion vs when to use #[instrument] macro - which is still what we will use most of the time when creating spans. I have a slight preference to be explicit here for readability, but I'm open if you and others think it's useful to add.

eserilev · 2026-01-14T22:22:00Z

sorry thought I already left a comment. yes I agree with you, the proc macro on second thought seems pretty useless. LGTM!

mergify · 2026-01-22T04:32:40Z

Merge Queue Status

✅ The pull request has been merged at d0c0324

This pull request spent 39 minutes 25 seconds in the queue, including 38 minutes 9 seconds running CI.
The checks were run on draft #8693.

Required conditions to merge

check-success=local-testnet-success
check-success=test-suite-success

jimmygchen requested a review from eserilev January 12, 2026 05:11

jimmygchen added ready-for-review The code is ready for review tracing labels Jan 12, 2026

eserilev reviewed Jan 12, 2026

View reviewed changes

jimmygchen added 2 commits January 13, 2026 16:26

Add unit test for AllowedRootSpanSampler.

3e67608

jimmygchen requested a review from jxs as a code owner January 13, 2026 06:39

Fix cargo sort and fmt

d0c0324

jimmygchen requested a review from eserilev January 13, 2026 07:06

eserilev approved these changes Jan 13, 2026

View reviewed changes

eserilev mentioned this pull request Jan 13, 2026

feat: adding telemetry-trace-sample-rate cli arg and set default to 1% #8561

Open

eserilev approved these changes Jan 14, 2026

View reviewed changes

jimmygchen added ready-for-merge This PR is ready to merge. and removed ready-for-review The code is ready for review labels Jan 22, 2026

mergify bot added the queued label Jan 22, 2026

mergify bot added a commit that referenced this pull request Jan 22, 2026

Merge of #8647

c824400

mergify bot mentioned this pull request Jan 22, 2026

merge queue: embarking unstable (21cabba) and #8647 together #8693

Closed

6 tasks

mergify bot merged commit 7f06500 into sigp:unstable Jan 22, 2026
36 checks passed

mergify bot removed the queued label Jan 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement custom OpenTelemetry sampler to filter uninstrumented traces #8647

Implement custom OpenTelemetry sampler to filter uninstrumented traces #8647

jimmygchen commented Jan 12, 2026 •

edited

Loading

Uh oh!

eserilev left a comment

Uh oh!

eserilev Jan 12, 2026

Uh oh!

jimmygchen Jan 13, 2026

Uh oh!

jimmygchen Jan 13, 2026

Uh oh!

eserilev Jan 13, 2026

Uh oh!

jimmygchen Jan 13, 2026

Uh oh!

eserilev Jan 12, 2026

Uh oh!

jimmygchen Jan 13, 2026

Uh oh!

eserilev left a comment •

edited

Loading

Uh oh!

jimmygchen commented Jan 14, 2026

Uh oh!

eserilev commented Jan 14, 2026

Uh oh!

mergify bot commented Jan 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement custom OpenTelemetry sampler to filter uninstrumented traces #8647

Implement custom OpenTelemetry sampler to filter uninstrumented traces #8647

Conversation

jimmygchen commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Additional Info

Uh oh!

eserilev left a comment

Choose a reason for hiding this comment

Uh oh!

eserilev Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

jimmygchen Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

jimmygchen Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

eserilev Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

jimmygchen Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

eserilev Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

jimmygchen Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

eserilev left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jimmygchen commented Jan 14, 2026

Uh oh!

eserilev commented Jan 14, 2026

Uh oh!

mergify bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Queue Status

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jimmygchen commented Jan 12, 2026 •

edited

Loading

eserilev left a comment •

edited

Loading

mergify bot commented Jan 22, 2026 •

edited

Loading