Add semaphore to limit concurrent API calls during Kubernetes observer startup by tomerqodo · Pull Request #11 · agentic-review-benchmarks/prefect-combined-sentry

tomerqodo · 2026-02-02T19:45:16Z

Benchmark PR from agentic-review-benchmarks#11

…r startup closes PrefectHQ#19937 When the observer starts, it receives events for all existing pods/jobs and checks for duplicates via API calls. This can overwhelm the server when there are many pods. This change adds a configurable semaphore (default: 5 concurrent calls) to rate-limit these startup checks. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

sentry · 2026-02-02T19:48:01Z

+                    )
+                ),
+            )

        response = await orchestration_client.request(
            "POST",
            "/events/filter",
-            json=dict(filter=event_filter.model_dump(exclude_unset=True, mode="json")),
+            json=dict(
+                filter=event_filter.model_dump(exclude_unset=True, mode="json")
+            ),
        )


Bug: The orchestration_client.request() call is outside the _startup_event_semaphore block, so the API requests are not actually rate-limited as intended.
_{Severity: CRITICAL}

Suggested Fix

Move the response = await orchestration_client.request(...) call inside the async with _startup_event_semaphore: block. This will ensure that the actual network requests are being limited by the semaphore as intended.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/integrations/prefect-kubernetes/prefect_kubernetes/observer.py#L137-L159 Potential issue: The code introduces a semaphore, `_startup_event_semaphore`, to rate-limit API calls during observer startup. However, the `async with` block for the semaphore only wraps the creation of the `event_filter` object. The actual network request, `await orchestration_client.request(...)`, is executed after the semaphore context has been exited. As a result, the expensive API calls to the `/events/filter` endpoint are not rate-limited, defeating the purpose of the change. This can lead to the observer overwhelming the API server with concurrent requests upon startup when many pods are present.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

desertaxle and others added 2 commits January 25, 2026 12:10

update pr

5297067

sentry Bot reviewed Feb 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add semaphore to limit concurrent API calls during Kubernetes observer startup#11

tomerqodo commented Feb 2, 2026

Uh oh!

sentry Bot Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tomerqodo commented Feb 2, 2026

Uh oh!

sentry Bot Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants