Skip to content

On demand repairs with ERROR status #1307

@cezarpaulo16

Description

@cezarpaulo16

Description:

1- Does the problem persist?
Yes

2- Describe how to reproduce the issue:

  1. Deploy ecChronos with 2 datacenters with 3 Cassandra replicas each and 2 ecChronos, one for each datacenter.
  2. Create two tables tb1 and tb2 in the same keyspace.
  3. Restart ecchornos Pod from datacenter two.
  4. Trigger on-demand repair for the keyspace: "ecctool run-repairs -k ks --all" in both data centers.
  5. Check the repair status; some of them get an ERROR status in data center two.

Detailed description:

• What is happening?
On demand
repairs get ERROR status.

• What did you expect to happen?
repairs to work successfully.

• What have you tried?

• What version of ecChronos are you using?
Latest version of agent/master branch

• Was the problem detected during an upgrade or downgrade procedure?
No

• Have you checked the ecChronos documents?
Yes.

• What do YOU think is the issue?
There are Lock Acquisition Failures: Multiple "Unable to lock resource RepairResource-{node-id}-1" errors across both datacenter1 and datacenter2, but not sure if that is related to the ERROR.

• Anything else that is worth being noted?

Full logs will be shared with Victor.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions