[connector/failover] Better handling logic for no healthy pipelines by singhvibhanshu · Pull Request #47211 · open-telemetry/opentelemetry-collector-contrib

singhvibhanshu · 2026-03-27T12:32:02Z

Resolves #46820

Description:
If each pipeline in the failover priorities reports an error, the connector gets stuck until the retry interval allows it to re-attempt a prior exporter. This can result in the collector dropping data for a significant amount of time, even if a higher-priority pipeline has already recovered.

This PR introduces a logic to consumeByHealthyPipeline for all three signals (Traces, Metrics, Logs). When the last priority pipeline returns an error, the connector will now attempt one immediate loop at retrying the unhealthy consumers. If a prior consumer has recovered, the data is successfully exported and the stable index is reset, preventing unnecessary data loss.

Signed-off-by: singhvibhanshu <singhvibhanshu@hotmail.com>

jmacd · 2026-03-27T14:56:52Z

connector/failoverconnector/logs.go

 		}

 		if err := tc.ConsumeLogs(ctx, ld); err != nil {
+			if idx > 0 && idx == len(f.cfg.PipelinePriority)-1 {


What's the idx > 0 test do for us? I take it we want to try at least once even when all servers are unhealthy?

i just thought if a user configure the failover connector with just only a single pipeline therefore added idx > 0 just to prevent unnecessary fn call

and yes we want to try one last time even when all servers are unhealthy

Signed-off-by: singhvibhanshu <singhvibhanshu@hotmail.com>

singhvibhanshu · 2026-03-30T17:11:08Z

Hi there!
Kindly have a look at this.
Thanks!

bettter handling logic for no healthy pipelines

c06fc61

Signed-off-by: singhvibhanshu <singhvibhanshu@hotmail.com>

singhvibhanshu requested review from a team and fatsheep9146 as code owners March 27, 2026 12:32

github-actions bot assigned dashpole Mar 27, 2026

github-actions bot added the connector/failover label Mar 27, 2026

github-actions bot requested a review from akats7 March 27, 2026 12:32

jmacd approved these changes Mar 27, 2026

View reviewed changes

ci fixing via isolating tests

214e5c9

Signed-off-by: singhvibhanshu <singhvibhanshu@hotmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[connector/failover] Better handling logic for no healthy pipelines#47211

[connector/failover] Better handling logic for no healthy pipelines#47211
singhvibhanshu wants to merge 2 commits intoopen-telemetry:mainfrom
singhvibhanshu:fix/retryInterval

singhvibhanshu commented Mar 27, 2026

Uh oh!

jmacd Mar 27, 2026

Uh oh!

singhvibhanshu Mar 27, 2026

Uh oh!

singhvibhanshu commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

singhvibhanshu commented Mar 27, 2026

Uh oh!

jmacd Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

singhvibhanshu Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

singhvibhanshu commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants