Add tracing spans for input and output gates for hold and wait #5666

shrima-cf · 2025-12-09T19:50:57Z

These spans will be useful during latency investigations for Durable Objects

Previous context - #5631

justin-mp

I think this is trending in the right direction.

Where do we capture the spans for SQLite operations?

justin-mp · 2025-12-09T23:14:39Z

src/workerd/io/io-gate.h

    // before the gate is unlocked.
    Lock addRef() {
-      return Lock(*gate);
+      return Lock(*gate, nullptr);


Do we know how often this is called? It would be nice to preserve the span if we can.

justin-mp · 2025-12-10T13:59:53Z

src/workerd/api/actor-state.c++

  auto& context = IoContext::current();
  auto userSpan = context.makeUserTraceSpan("durable_object_storage_sync"_kjc);
-  KJ_IF_SOME(p, cache->onNoPendingFlush()) {
+  KJ_IF_SOME(p, cache->onNoPendingFlush(context.getCurrentTraceSpan())) {


Why is this better to do than context.makeTraceSpan like we do in DurableObjectStorage::getCurrentBookmark()?

justin-mp · 2025-12-10T15:53:13Z

src/workerd/io/actor-sqlite.h

  void shutdown(kj::Maybe<const kj::Exception&> maybeException) override;
  kj::OneOf<CancelAlarmHandler, RunAlarmHandler> armAlarmHandler(
-      kj::Date scheduledTime, bool noCache = false, kj::StringPtr actorId = "") override;
+      kj::Date scheduledTime, bool noCache, kj::StringPtr actorId, SpanParent parentSpan) override;


You can keep the optional parameters by putting the parentSpan before them. Unless we always provide the optional parameters, it's probably easier to keep the previous interface.

justin-mp · 2025-12-10T15:55:37Z

src/workerd/io/actor-sqlite.h


+  // Trace span for the current commit operation. Captured from the first write
+  // that triggers a commit, used for the output gate lock hold trace.
+  SpanParent currentCommitSpan = nullptr;


What if the first write is allowUnconfirmed? At that point, we don't actually lock the output gate.

justin-mp · 2025-12-10T16:03:26Z

src/workerd/io/actor-sqlite.c++

+    // Reset the commit span after the commit completes
+    auto resetSpan = kj::defer([this]() {
+      currentCommitSpan = nullptr;
+      hasCommitSpan = false;
+    });


New commits can start before the previous commit has finished. The right thing to do is to kj::mv the currentCommitSpan into the commitImpl and reset it it immediately.

justin-mp · 2025-12-10T16:06:43Z

src/workerd/io/actor-sqlite.c++

+      // Capture trace span from the alarm handler for the commit batch.
+      if (!hasCommitSpan) {
+        currentCommitSpan = kj::mv(deferredAlarmSpan);
+        hasCommitSpan = true;
+      }


I don't quite understand why we're capturing the span here?

justin-mp · 2025-12-10T16:22:34Z

src/workerd/io/actor-sqlite.c++

+  // Capture trace span from the first write in this commit batch.
+  if (!hasCommitSpan) {
+    currentCommitSpan = kj::mv(traceSpan);
+    hasCommitSpan = true;
+  }


I think a better pattern would be to capture the currentCommitSpan on every write and then use whatever is the current capture when you actually do lockWhile. That way if you have a allowUnconfirmed write, you won't capture that one, which won't actually wait for the output gate. (As an optimization, you could stop overwriting the currentCommitSpan once you send it to the lockWhile.)

Also, this really should be a method that we call rather than duplicating it in every method.

justin-mp · 2025-12-10T16:27:20Z

src/workerd/io/actor-sqlite.h

+  // Trace span for the deferred alarm deletion, captured from armAlarmHandler and used when
+  // the alarm is actually deleted.
+  SpanParent deferredAlarmSpan = nullptr;


Why do we need this one separate from the currentCommitSpan?

justin-mp · 2025-12-10T16:35:55Z

src/workerd/io/actor-cache.c++

+  // Capture the first span for use at commit time
+  if (!hasCommitSpan) {
+    commitSpan = kj::mv(traceSpan);
+    hasCommitSpan = true;
+  }


See my comment in the sqlite file. I think the same thing applies here, at least when it comes to upgrading the output gate.

Add tracing spans for input gate hold and wait

7d326c9

shrima-cf requested review from a-robinson and justin-mp December 9, 2025 19:50

shrima-cf requested review from a team as code owners December 9, 2025 19:50

shrima-cf added 2 commits December 9, 2025 12:41

Pass Spans to InputGate wait()

499050b

Add tracing spans for output gate wait

63fe0ba

shrima-cf force-pushed the shrima/STOR-3398-3 branch 2 times, most recently from e7436b0 to 71c15e3 Compare December 9, 2025 21:18

shrima-cf added 3 commits December 9, 2025 13:31

Pass Spans to OutputGate wait()

629bf90

Add tracing spans for output gate hold

3909af1

Pass Spans to OutputGate lockWhile

79c3cb4

shrima-cf force-pushed the shrima/STOR-3398-3 branch from 71c15e3 to 79c3cb4 Compare December 9, 2025 21:53

This comment was marked as outdated.

Sign in to view

justin-mp reviewed Dec 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tracing spans for input and output gates for hold and wait #5666

Add tracing spans for input and output gates for hold and wait #5666

Uh oh!

shrima-cf commented Dec 9, 2025

Uh oh!

This comment was marked as outdated.

justin-mp left a comment

Uh oh!

justin-mp Dec 9, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

justin-mp Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add tracing spans for input and output gates for hold and wait #5666

Are you sure you want to change the base?

Add tracing spans for input and output gates for hold and wait #5666

Uh oh!

Conversation

shrima-cf commented Dec 9, 2025

Uh oh!

This comment was marked as outdated.

justin-mp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants