Skip to content

Bug Report: vttablet with high replication lag start as healthy instead of unhealthy #17922

Open
@L3o-pold

Description

@L3o-pold

Overview of the Issue

vttablet was shutdown for maintenance. After 11 hours, we started it again.

With a 11 hours replication lag, it started as healthy then some seconds after that switched to unhealthy (due to replication lag).

We think that vttablet should be started as unhealthy to avoid sending query to this replica.

Image

Reproduction Steps

start a vttablet with a lag greater than the discovery_high_replication_lag_minimum_serving vtgate argument.

Binary Version

mysqlctld version Version: 21.0.3 (Git revision 94fdc736eae8928a8fdde44e9ec9c3bee1868d6f branch 'HEAD') built on Wed Feb 12 16:52:22 UTC 2025 by vitess@buildkitsandbox using go1.23.5 linux/amd64

Operating System and Environment details

docker

Log Fragments

vttablet I0306 08:59:14.831788       1 state_manager.go:740] Replication is healthy

vttablet I0306 08:59:19.833346       1 state_manager.go:735] Going unhealthy due to high replication lag: 10h51m48s

vttablet I0306 09:00:39.848743       1 state_manager.go:740] Replication is healthy

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions