Skip to content

Report more detail of successful/failed channels in ChannelsConnectedListener#onTimeout #110907

Open
@DaveCTurner

Description

@DaveCTurner

A Transport$Connection comprises several TCP channels. When establishing such a connection we first open all the TCP channels. If this process takes longer than the timeout (default 30s) then we tear down any successful channels and report a failure. Today the failure message is just "connect_timeout[" + connectionProfile.getConnectTimeout() + "]" indicating the timeout that elapsed, but it would also be useful to know whether we failed to connect any channels within the time, or whether perhaps it was just one that failed: if some succeed then that suggests an application issue whereas if none succeed then it's more likely infrastructural.

Let's extend this message to report the number of channels which didn't connect within the timeout, and maybe even describe the channels which did connect successfully (if any).

Metadata

Metadata

Assignees

No one assigned

    Labels

    :Distributed Coordination/NetworkHttp and internode communication implementations>enhancementSupportabilityImprove our (devs, SREs, support eng, users) ability to troubleshoot/self-service product better.Team:Distributed (Obsolete)Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions