Skip to content

Conversation

@jglick
Copy link
Member

@jglick jglick commented Oct 2, 2025

Compare #675. This is called from innerRun which already has its own retry logic.

A TCP agent is used in a Pipeline build as part of an integration test in CloudBees CI which restarts the controller and picks a different host:port after the restart. The shutdown of the original controller naturally causes the agent to disconnect. When it tries to connect again, depending on timing conditions, JnlpAgentEndpointResolver.resolve might initially have obtained the old host:port, and then connectTcp will try repeatedly to contact it, failing each time. Eventually it will give up and fall back into the main retry loop in innerRun, but by that time the resumed build might have given up on the agent reconnecting after a 5m timeout and aborted. At least I do see the 5m timeout, though I do not see any messages from innerRun after connectTcp gives up, so I am not exactly sure what the agent is doing at that point.

#771 (comment) remains pending.

@jglick jglick added the bug For changelog: Fixes a bug. label Oct 2, 2025
@jglick jglick requested a review from a team October 2, 2025 19:49
@jglick jglick marked this pull request as ready for review October 2, 2025 19:49
@jglick jglick requested a review from Vlatombe October 2, 2025 19:49
@Vlatombe
Copy link
Member

Vlatombe commented Oct 3, 2025

#771 (comment) remains pending.

#773

@jglick jglick merged commit 03dee9b into jenkinsci:master Oct 3, 2025
14 checks passed
@jglick jglick deleted the connectTcp branch October 3, 2025 17:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug For changelog: Fixes a bug.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants