Skip to content

Removed node cannot rejoin properly #140

@mengrj

Description

@mengrj

While using Jepsen to test raft-0.17.1 and dqlite-0.14.0, we frequently observe the following issue. The removed node cannot rejoin properly.

2023-04-17 07:15:12 Jepsen starting LIBDQLITE_TRACE=1 LIBRAFT_TRACE=1 /opt/fs/dqlite/app -dir /opt/fs/dqlite/data -node n1 -latency 10 -cluster n1,n2,n3,n4,n5
2023/04/17 07:15:12.708606 starting "n1" with IP "10.1.1.3" and cluster "n1,n2,n3,n4,n5"
2023/04/17 07:15:12.708674 node was removed
2023-04-17 07:15:45 Jepsen starting LIBDQLITE_TRACE=1 LIBRAFT_TRACE=1 /opt/fs/dqlite/app -dir /opt/fs/dqlite/data -node n1 -latency 10 -cluster n1,n2,n3,n4,n5
2023/04/17 07:15:45.284433 starting "n1" with IP "10.1.1.3" and cluster "n1,n2,n3,n4,n5"
2023/04/17 07:15:45.284495 node was removed
2023-04-17 07:15:47 Jepsen starting LIBDQLITE_TRACE=1 LIBRAFT_TRACE=1 /opt/fs/dqlite/app -dir /opt/fs/dqlite/data -node n1 -latency 10 -cluster n1,n2,n3,n4,n5
2023/04/17 07:15:47.698687 starting "n1" with IP "10.1.1.3" and cluster "n1,n2,n3,n4,n5"
2023/04/17 07:15:47.698761 node was removed
2023-04-17 07:16:17 Jepsen starting LIBDQLITE_TRACE=1 LIBRAFT_TRACE=1 /opt/fs/dqlite/app -dir /opt/fs/dqlite/data -node n1 -latency 10 -cluster n1,n2,n3,n4,n5
2023/04/17 07:16:17.750930 starting "n1" with IP "10.1.1.3" and cluster "n1,n2,n3,n4,n5"
2023/04/17 07:16:17.750983 node was removed
2023-04-17 07:16:35 Jepsen starting LIBDQLITE_TRACE=1 LIBRAFT_TRACE=1 /opt/fs/dqlite/app -dir /opt/fs/dqlite/data -node n1 -latency 10 -cluster n1,n2,n3,n4,n5
2023/04/17 07:16:35.314053 starting "n1" with IP "10.1.1.3" and cluster "n1,n2,n3,n4,n5"
2023/04/17 07:16:35.314106 node was removed
2023-04-17 07:16:37 Jepsen starting LIBDQLITE_TRACE=1 LIBRAFT_TRACE=1 /opt/fs/dqlite/app -dir /opt/fs/dqlite/data -node n1 -latency 10 -cluster n1,n2,n3,n4,n5
2023/04/17 07:16:37.821749 starting "n1" with IP "10.1.1.3" and cluster "n1,n2,n3,n4,n5"
2023/04/17 07:16:37.821814 node was removed
2023-04-17 07:16:50 Jepsen starting LIBDQLITE_TRACE=1 LIBRAFT_TRACE=1 /opt/fs/dqlite/app -dir /opt/fs/dqlite/data -node n1 -latency 10 -cluster n1,n2,n3,n4,n5
2023/04/17 07:16:50.335459 starting "n1" with IP "10.1.1.3" and cluster "n1,n2,n3,n4,n5"

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions