[DBM] Add container tags hash to queries (if enabled) by vandonr · Pull Request #8061 · DataDog/dd-trace-dotnet

vandonr · 2026-01-14T15:03:28Z

Summary of changes

Add the ability to write the container tags hash to DBM queries + to the related span.
The goal is that DBM would then query the spans bearing that hash, and then use the container tags on this (those) spans(s) to enrich the queries with it.
This is controlled by a setting that is disabled by default, and would be enabled if propagation mode is "service" or greater

see RFC: https://docs.google.com/document/d/15GtNOKGBCt6Dc-HsDNnMmCdZwhewFQx8yUlI9in5n3M
related PR in python: DataDog/dd-trace-py#15293

Reason for change

DBM and DSM propagate service context in outbound communications (SQL comments, message headers), but neither product has awareness of the container environment (e.g., kube_cluster, namespace, pod_name). Propagating full container tags is not feasible due to cardinality constraints (query cache invalidation in OracleDB/SQLServer, exponential pathway growth in DSM) and size limitations (64–128 bytes for DBM non-comment methods).

This is needed for the service renaming initiative (defining services based on container names) and APM primary tags (container-based dimensions like Kubernetes cluster).

The solution: the agent computes a hash of low-cardinality container tags and back-propagates it to the tracer, which includes it in outbound DBM/DSM communications. DBM then resolves the hash by correlating with APM spans that carry the same hash as a span tag.

Implementation details

Add BaseHash static class that computes an FNV-64 hash of ProcessTags.SerializedTags combined with the container tags hash from the agent, encoded as base64
Read the container tags hash from the Datadog Agent via DiscoveryService, stored in ContainerMetadata.ContainerTagsHash
ContainerMetadata converted from static to instance class (singleton via ContainerMetadata.Instance) to improve testability
DatabaseMonitoringPropagator injects the base hash into SQL comments (as ddch) when DD_DBM_INJECT_SQL_BASEHASH is true
Add _dd.dbm_container_tags_hash span tag on SqlTags so DBM can correlate the hash back to the span's container tags
New config key DD_DBM_INJECT_SQL_BASEHASH (disabled by default), intended to be enabled when DBM propagation mode is service or higher
Add container ID header to MinimalAgentHeaderHelper for agent communication

Test coverage

Adding a test in DbScopeFactoryTests.cs forced me to inject the value from pretty high, which I find a bit "dirty", but at least we don't have to rely on global static instance in tests.

Other details

## Summary of changes Replaced custom mutex guard with `std::lock_guard`, using `std::recursive_mutex` instead of `CRITICAL_SECTION` in windows and `std::mutex` with railings in Linux ## Reason for change Some locks have been spotted in smoke test wich could be cause by the lack of thread recursive lock in the `std::mutex` ## Implementation details ## Test coverage ## Other details

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4fd01fab6f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-14T15:16:10Z

tracer/src/Datadog.Trace/ClrProfiler/AutoInstrumentation/AdoNet/DbScopeFactory.cs

                    else
                    {
-                        // PropagateDataViaComment (service) - this injects varius trace information as a comment in the query
+                        if (tracer.Settings.InjectSqlBasehash && !string.IsNullOrEmpty(baseHash))
+                        {
+                            tags.BaseHash = baseHash;


Set BaseHash even when DBM comment already present

This new BaseHash tagging only happens in the else branch when the command text is not already DBM-injected. In the cached‑command scenario (or when users pre‑inject DBM comments), alreadyInjected is true, so _dd.propagated_hash is never set on subsequent spans even though the query still carries ddsh in the SQL comment. If DBM looks up container tags by scanning recent spans for that hash, later queries can’t be enriched once the first span ages out. Consider setting tags.BaseHash whenever the feature is enabled (and baseHash is non‑empty), regardless of the alreadyInjected branch.

Useful? React with 👍 / 👎.

hmm, yes, that's an interesting point, but I'm not sure we care, we only need one span with the hash to get the values, so we don't really need to tag all spans. I think in practice it works well like this.

Consider adding a comment explaining this.

pr-commenter · 2026-01-14T16:02:18Z

Benchmarks

Benchmark execution time: 2026-03-24 11:01:35

Comparing candidate commit 900ac90 in PR branch vandonr/process2 with baseline commit 4e38cdd in branch master.

Found 9 performance improvements and 7 performance regressions! Performance is the same for 258 metrics, 14 unstable metrics.

Explanation

This is an A/B test comparing a candidate commit's performance against that of a baseline commit. Performance changes are noted in the tables below as:

🟩 = significantly better candidate vs. baseline
🟥 = significantly worse candidate vs. baseline

We compute a confidence interval (CI) over the relative difference of means between metrics from the candidate and baseline commits, considering the baseline as the reference.

If the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD), the change is considered significant.

Feel free to reach out to #apm-benchmarking-platform on Slack if you have any questions.

More details about the CI and significant changes

You can imagine this CI as a range of values that is likely to contain the true difference of means between the candidate and baseline commits.

CIs of the difference of means are often centered around 0%, because often changes are not that big:

---------------------------------(------|---^--------)-------------------------------->
                              -0.6%    0%  0.3%     +1.2%
                                 |          |        |
         lower bound of the CI --'          |        |
sample mean (center of the CI) -------------'        |
         upper bound of the CI ----------------------'

As described above, a change is considered significant if the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD).

For instance, for an execution time metric, this confidence interval indicates a significantly worse performance:

----------------------------------------|---------|---(---------^---------)---------->
                                       0%        1%  1.3%      2.2%      3.1%
                                                  |   |         |         |
       significant impact threshold --------------'   |         |         |
                      lower bound of CI --------------'         |         |
       sample mean (center of the CI) --------------------------'         |
                      upper bound of CI ----------------------------------'

scenario:Benchmarks.Trace.AgentWriterBenchmark.WriteAndFlushEnrichedTraces netcoreapp3.1

🟩 execution_time [-87.790ms; -87.704ms] or [-44.085%; -44.042%]

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.ObjectExtractorMoreComplexBody net6.0

🟥 execution_time [+15.198ms; +19.106ms] or [+7.780%; +9.780%]

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.ObjectExtractorSimpleBody net6.0

🟩 execution_time [-16.899ms; -13.287ms] or [-7.904%; -6.215%]

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.ObjectExtractorSimpleBody netcoreapp3.1

🟥 execution_time [+16.264ms; +22.445ms] or [+8.284%; +11.432%]

scenario:Benchmarks.Trace.AspNetCoreBenchmark.SendRequest net6.0

🟩 execution_time [-8.803ms; -7.370ms] or [-8.834%; -7.396%]

scenario:Benchmarks.Trace.CIVisibilityProtocolWriterBenchmark.WriteAndFlushEnrichedTraces net472

🟩 execution_time [-16.356ms; -13.298ms] or [-7.013%; -5.702%]
🟩 throughput [+62.858op/s; +76.693op/s] or [+6.108%; +7.452%]

scenario:Benchmarks.Trace.CIVisibilityProtocolWriterBenchmark.WriteAndFlushEnrichedTraces netcoreapp3.1

🟥 execution_time [+56.258ms; +63.712ms] or [+27.570%; +31.222%]
🟥 throughput [-394.236op/s; -347.884op/s] or [-23.931%; -21.118%]

scenario:Benchmarks.Trace.CharSliceBenchmark.OptimizedCharSlice netcoreapp3.1

🟥 execution_time [+147.055µs; +157.625µs] or [+5.337%; +5.720%]
🟥 throughput [-19.705op/s; -18.331op/s] or [-5.429%; -5.051%]

scenario:Benchmarks.Trace.ElasticsearchBenchmark.CallElasticsearchAsync net472

🟥 throughput [-18999.263op/s; -16744.614op/s] or [-6.104%; -5.380%]

scenario:Benchmarks.Trace.Iast.StringAspectsBenchmark.StringConcatAspectBenchmark netcoreapp3.1

🟩 allocated_mem [-16.498KB; -16.469KB] or [-6.045%; -6.034%]

scenario:Benchmarks.Trace.Log4netBenchmark.EnrichedLog netcoreapp3.1

🟩 execution_time [-38.452ms; -34.542ms] or [-18.879%; -16.959%]

scenario:Benchmarks.Trace.SingleSpanAspNetCoreBenchmark.SingleSpanAspNetCore netcoreapp3.1

🟩 throughput [+14923514.547op/s; +16273400.150op/s] or [+6.617%; +7.215%]

scenario:Benchmarks.Trace.TraceAnnotationsBenchmark.RunOnMethodBegin net472

🟩 throughput [+42708.064op/s; +44809.510op/s] or [+6.289%; +6.599%]

vandonr · 2026-01-15T14:53:13Z

I just realized I need to put process tags in there too

bouwkast

I think the main question that I have is that it appears this is correctly following the RFC in how we propagate the hash, but the merged Python implementation recomputes the hash.

But the RFC isn't precise enough in describing the hash and expected behavior / requirements for me to know which is correct really

tracer/src/Datadog.Trace/DatabaseMonitoring/DatabaseMonitoringPropagator.cs

tracer/src/Datadog.Trace/Tagging/SqlTags.cs

…to tracermanager and discoveryservice

vandonr · 2026-03-24T10:22:12Z

tracer/src/Datadog.Trace/Ci/TestOptimization.cs

            getDiscoveryServiceFunc: static s => DiscoveryService.CreateUnmanaged(
                s.TracerSettings.Manager.InitialExporterSettings,
                ContainerMetadata.Instance,
+                new ServiceRemappingHash(null),


this one I'm not 100% sure, but since it's only used for DBM for now, I don't think it'd play any role in that code path, so it should be safe to hardcode a disabled instance

lucaspimentel · 2026-03-24T18:30:37Z

tracer/src/Datadog.Trace/DatabaseMonitoring/DatabaseMonitoringPropagator.cs

        private const string SqlCommentOuthost = "ddh";
        private const string SqlCommentVersion = "ddpv";
        private const string SqlCommentEnv = "dde";
+        private const string SqlCommentBaseHash = "ddsh";


The PR description says

injects the base hash into SQL comments (as ddch)

I couldn't find either one in the RFC, but the dd-trace-py PR uses ddsh, like this one. Is that a typo in the PR description?

tracer/src/Datadog.Trace/ServiceRemappingHash.cs

lucaspimentel · 2026-03-24T19:01:59Z

tracer/src/Datadog.Trace/ServiceRemappingHash.cs

+    public string? ContainerTagsHash
+    {
+        get;
+        private set;
+    }
+
+    /// <summary>
+    /// Gets the base64 representation of the hash
+    /// </summary>
+    public string? B64Value
+    {
+        get;
+        private set;
+    }


These properties used to have Volatile.Read()/Volatile.Write() and we should probably keep that since they are written from a background thread in DiscoveryService and read in the hot path when creating spans.

Furthermore, UpdateContainerTagsHash updates both values non-atomically, so a reader could see a stale B64Value with a new ContainerTagsHash. If consistency between the two is important, consider using a lock to read/write both values, or using immutable copies.

lucaspimentel · 2026-03-24T19:07:25Z

tracer/src/Datadog.Trace/ServiceRemappingHash.cs

+            hash = FnvHash64.GenerateHash(containerTagsHash, FnvHash64.Version.V1, hash);
+        }
+
+        var b64 = Convert.ToBase64String(BitConverter.GetBytes(hash));


This code is allocating:

byte[] in BitConverter.GetBytes()
char[] for the parameter in TrimEnd(params char[]) in .NET Framework (Newer runtimes have a TrimEnd(char) overload)
string in TrimEnd() if it modifies the string
more string instance for each Replace() if they modify the string

Good news! We have "vendored" versions of BinaryPrimitives and Base64, so we can avoid BitConverter.GetBytes() and Convert.ToBase64String(), and then trimming and replacing 1:1 chars can be done in place, so this code should work in all TFMs:

#if NETCOREAPP3_1_OR_GREATER Span<byte> buf = stackalloc byte[12]; #else // can't stackalloc into the vendored Span<T> var buf = new byte[12]; #endif BinaryPrimitives.WriteUInt64LittleEndian(buf, hash); // write 8 bytes into a 12-byte buffer Base64.EncodeToUtf8InPlace(buf, 8, out int bytesWritten); while (bytesWritten > 0 && buf[bytesWritten - 1] == (byte)'=') { bytesWritten--; } for (int i = 0; i < bytesWritten; i++) { if (buf[i] == (byte)'+') { buf[i] = (byte)'-'; } else if (buf[i] == (byte)'/') { buf[i] = (byte)'_'; } } #if NETCOREAPP3_1_OR_GREATER return Encoding.ASCII.GetString(buf[..bytesWritten]); #else // can't use Range return Encoding.ASCII.GetString(buf, 0, bytesWritten); #endif

This has zero heap allocations on NETCOREAPP3_1_OR_GREATER, and only the byte[12] otherwise (aside from the final string we need to return in both cases which is unavoidable).

lucaspimentel · 2026-03-24T19:10:12Z

tracer/src/Datadog.Trace/ServiceRemappingHash.cs

+using System.Threading;
+using Datadog.Trace.PlatformHelpers;


Not used.

Suggested change

using System.Threading;

using Datadog.Trace.PlatformHelpers;

lucaspimentel · 2026-03-24T19:14:57Z

tracer/src/Datadog.Trace/ClrProfiler/AutoInstrumentation/AdoNet/DbScopeFactory.cs

            public static Scope? CreateDbCommandScope(Tracer tracer, IDbCommand command)
            {
                var commandType = command.GetType();
+                var baseHash = tracer.TracerManager.ServiceRemappingHash?.B64Value;


Should we guard this behind the setting?

var baseHash = tracer.Settings.DbmInjectSqlBasehash ? tracer.TracerManager.ServiceRemappingHash?.B64Value : null;

lucaspimentel · 2026-03-24T19:17:06Z

tracer/src/Datadog.Trace/ServiceRemappingHash.cs

+        }
+    }
+
+    private static string Compute(string processTags, string? containerTagsHash)


While working on the "less-allocatey" code below, I noticed there are no unit tests for this method.

lucaspimentel · 2026-03-24T19:30:17Z

tracer/src/Datadog.Trace/PlatformHelpers/ContainerMetadata.NetFramework.cs

-            if (!_warnedOnSet)
-            {
-                _warnedOnSet = true;
-                Log.Error("The code is trying to set the value '{Value}' to {Prop}, but this has no effect in .NET Framework.", value, nameof(ContainerTagsHash));


This log is now gone in the new version. Intentional?

lucaspimentel · 2026-03-24T19:48:12Z

tracer/src/Datadog.Trace/ServiceRemappingHash.cs

+    /// <summary>
+    /// Gets the base64 representation of the hash
+    /// </summary>
+    public string? B64Value


[Naming nit] The .NET naming conventions would use Base64Value, here, or simply Base64. No need to abbreviate "Base" to "B".

lucaspimentel · 2026-03-24T20:50:31Z

related: #8363

Instead of guarding the caller with #if !NETFRAMEWORK, make the setter a silent no-op. This avoids conflict with #8061 which replaces the caller entirely.

vandonr and others added 10 commits December 2, 2025 18:03

read container tags hash from agent

9a80cbf

make ContainerMetadata an instance class

d565d21

use constant in tests

1ad4b02

adapt code to instance container metadata

37a3ac1

Merge remote-tracking branch 'origin/master' into vandonr/process3

c93c3a9

use local instance

66630ec

nit

dcb297a

fix integration tests

653a3a5

add container tags hash to DBM queries (if enabled)

4fd01fa

vandonr requested review from a team as code owners January 14, 2026 15:03

chatgpt-codex-connector bot reviewed Jan 14, 2026

View reviewed changes

vandonr marked this pull request as draft January 15, 2026 14:53

bouwkast reviewed Jan 15, 2026

View reviewed changes

tracer/src/Datadog.Trace/DatabaseMonitoring/DatabaseMonitoringPropagator.cs Show resolved Hide resolved

bouwkast reviewed Jan 15, 2026

View reviewed changes

tracer/src/Datadog.Trace/Tagging/SqlTags.cs Show resolved Hide resolved

vandonr mentioned this pull request Jan 19, 2026

Read container tags hash from agent #7893

Merged

vandonr added 8 commits January 19, 2026 14:48

use volatile read/write

d9abd75

use collection expression

a5f3f8d

add container ID header to MinimalAgentHeaderHelper

c18fc65

Merge remote-tracking branch 'origin/master' into vandonr/process3

4ca4fdf

Merge branch 'vandonr/process3' into vandonr/process2

fdaf436

add a class for basehash

043c8b6

use base hash

beb1980

rename configuration key (I missed the prefix)

a75fa60

Base automatically changed from vandonr/process3 to master February 3, 2026 18:49

Merge remote-tracking branch 'origin/master' into vandonr/process2

18f4461

vandonr added 2 commits March 16, 2026 10:40

nits

2f0eeb2

Merge remote-tracking branch 'origin/master' into vandonr/process2

1d2679c

vandonr changed the title ~~Add container tags hash to DBM queries (if enabled)~~ [DBM] Add container tags hash to queries (if enabled) Mar 17, 2026

vandonr added 3 commits March 17, 2026 17:02

replace static init with getter init

3d04aad

rename basehash for clarity

6af872d

apply the same changes to container metadata as in the DSM PR

d81eb71

vandonr force-pushed the vandonr/process2 branch from eb11b9e to d81eb71 Compare March 18, 2026 08:56

vandonr added 3 commits March 18, 2026 11:14

exclude test that is specific to netcore

335479d

Merge remote-tracking branch 'origin/master' into vandonr/process2

747be87

move tags hash to serviceremapping hash, make it non-static, feed it …

a89cb3f

…to tracermanager and discoveryservice

vandonr requested a review from a team as a code owner March 24, 2026 10:09

vandonr added 2 commits March 24, 2026 11:14

reduce number of modified files

d3c5d7f

undo some autoformating

900ac90

vandonr commented Mar 24, 2026

View reviewed changes

vandonr added 2 commits March 24, 2026 16:01

use url-safe b64

f1c304e

allow SRH injection into tracerManagerFactory

91dc0c7

vandonr force-pushed the vandonr/process2 branch from 2bf7dde to 91dc0c7 Compare March 24, 2026 15:05

fix itest build

44c94a7

lucaspimentel reviewed Mar 24, 2026

View reviewed changes

tracer/src/Datadog.Trace/ServiceRemappingHash.cs Show resolved Hide resolved

lucaspimentel reviewed Mar 24, 2026

View reviewed changes

lucaspimentel mentioned this pull request Mar 24, 2026

Wrap ContainerTagsHash around !NETFRAMEWORK #8363

Open

bouwkast added a commit that referenced this pull request Mar 24, 2026

Remove ContainerTagsHash Log.Error on .NET Framework

2893a40

Instead of guarding the caller with #if !NETFRAMEWORK, make the setter a silent no-op. This avoids conflict with #8061 which replaces the caller entirely.

Conversation

vandonr commented Jan 14, 2026 • edited by lucaspimentel Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary of changes

Reason for change

Implementation details

Test coverage

Other details

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pr-commenter bot commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

Explanation

More details about the CI and significant changes

scenario:Benchmarks.Trace.AgentWriterBenchmark.WriteAndFlushEnrichedTraces netcoreapp3.1

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.ObjectExtractorMoreComplexBody net6.0

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.ObjectExtractorSimpleBody net6.0

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.ObjectExtractorSimpleBody netcoreapp3.1

scenario:Benchmarks.Trace.AspNetCoreBenchmark.SendRequest net6.0

scenario:Benchmarks.Trace.CIVisibilityProtocolWriterBenchmark.WriteAndFlushEnrichedTraces net472

scenario:Benchmarks.Trace.CIVisibilityProtocolWriterBenchmark.WriteAndFlushEnrichedTraces netcoreapp3.1

scenario:Benchmarks.Trace.CharSliceBenchmark.OptimizedCharSlice netcoreapp3.1

scenario:Benchmarks.Trace.ElasticsearchBenchmark.CallElasticsearchAsync net472

scenario:Benchmarks.Trace.Iast.StringAspectsBenchmark.StringConcatAspectBenchmark netcoreapp3.1

scenario:Benchmarks.Trace.Log4netBenchmark.EnrichedLog netcoreapp3.1

scenario:Benchmarks.Trace.SingleSpanAspNetCoreBenchmark.SingleSpanAspNetCore netcoreapp3.1

scenario:Benchmarks.Trace.TraceAnnotationsBenchmark.RunOnMethodBegin net472

Uh oh!

vandonr commented Jan 15, 2026

Uh oh!

bouwkast left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lucaspimentel Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lucaspimentel Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lucaspimentel Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lucaspimentel commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

vandonr commented Jan 14, 2026 •

edited by lucaspimentel

Loading

pr-commenter bot commented Jan 14, 2026 •

edited

Loading

lucaspimentel Mar 24, 2026 •

edited

Loading

lucaspimentel Mar 24, 2026 •

edited

Loading

lucaspimentel Mar 24, 2026 •

edited

Loading