Skip to content

Improve "grace" period for "check". #23661

Open
@EugenKon

Description

@EugenKon

Proposal

grace 30s VS 5m (in case of database upgrade).

My interpretation of documentation is that grace instructs Nomad to not check the health of a service for this period at all. The logic can be summarized as follows:

Start service
Wait "grace" period
Start monitoring health check

I expected it is implemented a different "grace" strategy: discard failed health check while in grace period. The logic can be summarized as follows:

Start service
While in "grace" period:
    Run health check logic
    If OK, break
Start monitoring health check

The difference is that in the first case Nomad always waits for 5 minutes, even if the service is ready and healthy 30 seconds later. Whereas in the second case, Nomad will mark the service as healthy as soon as it passes the health criteria -- 30 seconds after its start.

Metadata

Metadata

Assignees

No one assigned

    Labels

    stage/acceptedConfirmed, and intend to work on. No timeline committment though.theme/docsDocumentation issues and enhancementstype/enhancement

    Type

    No type

    Projects

    Status

    Needs Roadmapping

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions