Skip to content

Question: supervisor tree fault tolerance #2684

@photoszzt

Description

@photoszzt

Hi Monarch developer,

I don't find this anywhere in the document. What happen when the process of the root of the supervisor tree fails due to either OOM or machine failure of the running node? Would everything below the root just die? Is there a leader election process to establish the new leader to be the new supervisor? I try to search the codebase and I don't see any raft or paxos references.

Metadata

Metadata

Assignees

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions