Perform database dump anonymisation using a local, in-cluster database

**User Need**

**As a** platform engineer
**I want** the database backup anonymisation process to happen in isolation from the real database
**so that** I can perform the process in production without risking the production database

---

### Context

The current process restores the dump into a fresh database inside the same instance, anonymises it, then swaps and deletes the databases. We don't do it in production, because we don't want to that to happen to the production database.

In this story, don't worry about how to make it happen in production at all or only. Just alter the existing anonymisation process.

Consider looking at how Postgres and MySQL can be configured to speed up database restoration times.

---

### What’s Needed

List anything the solution must do or be (behaviour, performance, security, UX, etc.).

- [x] Read the dump file from S3 onto disk in Kubernetes
- [x] Explore ways to speed up the dump process (if reasonable)
- [x] Restore the dump file into a locally running database
- [x] Explore ways to speed up the restore process (again if reasonable)
- [x] Anonymise it
- [x] Dump it back out to the right place

---

### Acceptance Criteria

- [ ] The anonymisation process happens in isolation of the real database
- [ ] The real database in staging and integration is replaced with the content of the anonymised database
- [ ] The production database is **not** affected

---

### Notes

* This work is done in a way that functions independently from existing applications/containers/processes etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perform database dump anonymisation using a local, in-cluster database #3760

Context

What’s Needed

Acceptance Criteria

Notes

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Perform database dump anonymisation using a local, in-cluster database #3760

Description

Context

What’s Needed

Acceptance Criteria

Notes

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions