Skip to content

Investigate why MemberReplace failpoint flakes on release-3.4 #18929

Open
@serathius

Description

@serathius

Bug report criteria

What happened?

In last robustness meeting we identified 3 flakes for memberReplace

All happening on release-3.4 and TestRobustnessExploratory/KubernetesHighTraffic/ClusterOfSize3/MemberReplace test

What did you expect to happen?

Issue not being specific to release-3.4

How can we reproduce it (as minimally and precisely as possible)?

There is no way to select failpoints via test name, but you can modify allFailpoints in test/robustness/failpoint/failpoint.go to leave only MemberReplace

And run it with GO_TEST_FLAGS='-v --run TestRobustnessExploratory/KubernetesHighTraffic/ClusterOfSize3 --count 100 --failfast --timeout 1h' make test-robustness-release-3.4

Anything else we need to know?

No response

Etcd version (please run commands below)

release-3.4 branch

Etcd configuration (command line flags or environment variables)

paste your configuration here

Etcd debug information (please run commands below, feel free to obfuscate the IP address or FQDN in the output)

$ etcdctl member list -w table
# paste output here

$ etcdctl --endpoints=<member list> endpoint status -w table
# paste output here

Relevant log output

No response

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions