New security group for Postgres access to registration db #1636

JuliaBrigitte · 2026-01-21T18:16:38Z

What does this change?

Added new security group for access to the registration db. This is to eventually replace the VPC default group and increase least privilege database access. Previously any application could reach the database due to their use of the VPC default group¹. Now an application needs to explicitly use the DatabaseAccessSecurityGroup. For ease, an SSM parameter is also created to reference the group.

How to test

The healthcheck has been updated to make a simple database request - if the healthcheck passes then we're able to connect to the database. Therefore, to test, deployment to CODE needs to succeed. To confirm we're only making the database query once, "Performing a query to check DB connectivity" should only appear once, regardless of how often the healthcheck request is made.

Co-authored by: @akash1810

They could be using the group for DB access, and other functionality. ↩

github-actions · 2026-01-22T10:17:23Z

Deploy build 4686 of `mobile-n10n:schedule` to CODE

All deployment options

From guardian/actions-riff-raff.

github-actions · 2026-01-22T10:17:39Z

Deploy build 4696 of `mobile-n10n:football` to CODE

All deployment options

From guardian/actions-riff-raff.

github-actions · 2026-01-22T10:17:49Z

Deploy build 4709 of `mobile-n10n:fakebreakingnewslambda` to CODE

All deployment options

From guardian/actions-riff-raff.

github-actions · 2026-01-22T10:18:04Z

Deploy build 4919 of `mobile-n10n:slomonitor` to CODE

All deployment options

From guardian/actions-riff-raff.

github-actions · 2026-01-22T10:18:09Z

Deploy build 4737 of `mobile-n10n:notification` to CODE

All deployment options

From guardian/actions-riff-raff.

github-actions · 2026-01-22T10:18:54Z

Deploy build 4937 of `mobile-n10n:registration` to CODE

All deployment options

From guardian/actions-riff-raff.

github-actions · 2026-01-22T10:19:55Z

Deploy build 4856 of `mobile-n10n:report` to CODE

All deployment options

From guardian/actions-riff-raff.

github-actions · 2026-01-22T10:20:06Z

Deploy build 5237 of `mobile-n10n:notificationworkerlambda` to CODE

All deployment options

From guardian/actions-riff-raff.

Co-authored-by: Akash <akash1810@users.noreply.github.com>

Co-authored-by: Julia <JuliaBrigitte@users.noreply.github.com>

jacobwinch · 2026-01-22T16:08:33Z

registration/app/registration/controllers/Main.scala

+
+  def healthCheck: Action[AnyContent] = Action.async {
+    // Check if we can talk to the registration database
+    registrar.dbHealthCheck()


Currently I think this is going to make every instance query the DB for every healthcheck request from the load balancer (and also whenever anyone on the public internet sends a request to /healthcheck, which doesn't seem to require authentication). This might not be desirable.

It might also have unintended consequences, for example if the DB becomes unavailable briefly then all instances will cycle at once, which could worsen an outage.

If the goal is to check DB connectivity, can we just check from when an instance launches until connectivity is established and then stop checking?

Alternatively, if this has served its purpose in terms of testing this PR then we could just fallback to the original healthcheck now?

It might also have unintended consequences, for example if the DB becomes unavailable briefly then all instances will cycle at once, which could worsen an outage.

Ah, great point. I think every route hits the database, not sure if that changes things...

If the goal is to check DB connectivity, can we just check from when an instance launches until connectivity is established and then stop checking?

Interesting. As an in memory flag, or as part of the user-data?

Alternatively, if this has served its purpose in terms of testing this PR then we could just fallback to the original healthcheck now?

I think it would be good to have a way to confirm connectivity on PROD, at least once.

Ah, great point. I think every route hits the database, not sure if that changes things...

It's hard to be sure if this is a good idea without doing some testing and/or reviewing the error handling in detail. For example, if we currently serve 500s when we can't talk to the DB and we start serving 503s (due to 0 registered targets), then clients might behave unexpectedly.

It might actually end up being a desirable change in some scenarios (e.g. it might help to protect the DB from being overwhelmed) but I'd be reluctant to change something with potentially wide-ranging consequences when it's not really the goal of this PR.

Interesting. As an in memory flag, or as part of the user-data?

My initial thought would be just making it a lazy val in the class so that we check once when the class is initialised and then cache the value.

I think it would be good to have a way to confirm connectivity on PROD, at least once.

Fair enough; in that case I would probably go with the approach of checking we can connect once before we allow the instance to go healthy. This means that the old instances won't be terminated if something goes wrong, so users are protected.

Fair enough; in that case I would probably go with the approach of checking we can connect once before we allow the instance to go healthy. This means that the old instances won't be terminated if something goes wrong, so users are protected.

Done in d894af9.

To avoid strain on the database, check connectivity only once on instance start (`lazy val`). We should see the log line exactly once per instance, regardless of how often the healthcheck endpoint is queried. Co-authored-by: Julia <JuliaBrigitte@users.noreply.github.com>

akash1810 · 2026-01-26T12:24:16Z

Confirming this deployed to PROD (we have three log lines as we have three instances):

JuliaBrigitte requested a review from a team as a code owner January 21, 2026 18:16

JuliaBrigitte force-pushed the aajb/import-db-yaml-from-mobile-platform branch from 144c117 to 6174a3e Compare January 22, 2026 10:16

JuliaBrigitte marked this pull request as draft January 22, 2026 14:04

akash1810 force-pushed the aajb/import-db-yaml-from-mobile-platform branch 5 times, most recently from dd6fa8a to f286f55 Compare January 22, 2026 15:26

New security group for Postgres access to registration db

a8d0a26

Co-authored-by: Akash <akash1810@users.noreply.github.com>

akash1810 force-pushed the aajb/import-db-yaml-from-mobile-platform branch from f286f55 to 6dc5ffa Compare January 22, 2026 15:34

JuliaBrigitte and others added 2 commits January 22, 2026 15:48

Healthcheck now checks if registration db can be reached.

d189dbc

Co-authored-by: Akash <akash1810@users.noreply.github.com>

feat: Update registrations to use new database access security group

e462e3e

Co-authored-by: Julia <JuliaBrigitte@users.noreply.github.com>

akash1810 force-pushed the aajb/import-db-yaml-from-mobile-platform branch from 6dc5ffa to e462e3e Compare January 22, 2026 15:48

akash1810 approved these changes Jan 22, 2026

View reviewed changes

akash1810 marked this pull request as ready for review January 22, 2026 15:52

jacobwinch reviewed Jan 22, 2026

View reviewed changes

akash1810 force-pushed the aajb/import-db-yaml-from-mobile-platform branch from 3cbbf5c to d894af9 Compare January 26, 2026 10:45

akash1810 approved these changes Jan 26, 2026

View reviewed changes

JuliaBrigitte merged commit 1e156eb into main Jan 26, 2026
9 checks passed

JuliaBrigitte deleted the aajb/import-db-yaml-from-mobile-platform branch January 26, 2026 12:12

New security group for Postgres access to registration db #1636

New security group for Postgres access to registration db #1636

Uh oh!

Conversation

JuliaBrigitte commented Jan 21, 2026 • edited by akash1810 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this change?

How to test

Footnotes

Uh oh!

github-actions bot commented Jan 22, 2026

Uh oh!

github-actions bot commented Jan 22, 2026

Uh oh!

github-actions bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jacobwinch Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

akash1810 Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jacobwinch Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

akash1810 commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JuliaBrigitte commented Jan 21, 2026 •

edited by akash1810

Loading

github-actions bot commented Jan 22, 2026 •

edited

Loading

github-actions bot commented Jan 22, 2026 •

edited

Loading

github-actions bot commented Jan 22, 2026 •

edited

Loading

github-actions bot commented Jan 22, 2026 •

edited

Loading

github-actions bot commented Jan 22, 2026 •

edited

Loading

github-actions bot commented Jan 22, 2026 •

edited

Loading

jacobwinch Jan 22, 2026 •

edited

Loading

akash1810 Jan 22, 2026 •

edited

Loading

jacobwinch Jan 22, 2026 •

edited

Loading