Backfill submission geocache, fixes getodk/central#1373 #1623

brontolosone · 2025-09-28T09:29:32Z

What has been done to verify that this works as intended?

Manual testing.

Why is this the best possible solution? Were any other approaches considered?

Batching is used because it gives the user some sort of progress meter, which is nice if there's a lot to process and/or their DB is slow. They'll continue to receive visual feedback that there's nothing stuck.

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

N/A

Does this change require updates to the API documentation? If so, please update docs/api.yaml as part of this PR.

N/A

Before submitting this PR, please make sure you have:

run make test and confirmed all checks still pass OR confirm CircleCI build passes
verified that any code from external sources are properly credited in comments or that everything is internally sourced

sadiqkhoja · 2025-10-08T15:26:50Z

lib/model/migrations/20250928-01-backfill-submission-geocache-createfunction.js

+const path = require('path');
+
+
+function getSqlFiles(upOrDown) {


what do you think of moving this function into a common file? it is duplicate of what's in 20250927-01-geoextracts.js

sadiqkhoja · 2025-10-08T15:40:14Z

lib/model/migrations/20250928-01-backfill-submission-geocache-doit.js

+
+
+const up = async (db) => {
+  const BATCH_SIZE = 1000;


Why do we need to do batching during migration/upgrade-time? Server would be down in any case so I don't see any problem populating the cache in one go.

We can keep the changes in cache_all_submission_geo to support batching that would be handy down the line when we might want to run the function at run-time

sadiqkhoja · 2025-10-08T16:00:46Z

lib/model/migrations/20250928-01-backfill-submission-geocache-createfunction-01.up.sql

+DROP FUNCTION IF EXISTS "public"."cache_all_submission_geo"() CASCADE;
+
+--- create: cache_all_submission_geo(only_default_path boolean, batchsize integer) ---
+CREATE FUNCTION "public"."cache_all_submission_geo"(only_default_path boolean = false, batchsize integer = 9223372036854775807)


can we set the default value of batchsize to NULL? I am thinking about cases where selected rows are greater than 9223372036854775807. From the docs:

LIMIT ALL is the same as omitting the LIMIT clause, as is LIMIT with a NULL argument.

brontolosone force-pushed the 1373_backfill-submission-geo-cache branch from 8cdb999 to d17210a Compare September 28, 2025 09:56

brontolosone requested a review from sadiqkhoja September 28, 2025 10:01

brontolosone force-pushed the 1373_backfill-submission-geo-cache branch from d17210a to 4091c60 Compare September 28, 2025 10:03

backfill submission geocache, fixes getodk/central#1373

2790e4d

brontolosone force-pushed the 1373_backfill-submission-geo-cache branch from 4091c60 to 2790e4d Compare September 28, 2025 14:00

brontolosone marked this pull request as ready for review September 28, 2025 14:05

sadiqkhoja reviewed Oct 8, 2025

View reviewed changes

matthew-white mentioned this pull request Oct 10, 2025

Update user docs for Central v2025.3 getodk/docs#1983

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Backfill submission geocache, fixes getodk/central#1373 #1623

Backfill submission geocache, fixes getodk/central#1373 #1623

Uh oh!

brontolosone commented Sep 28, 2025

Uh oh!

sadiqkhoja Oct 8, 2025

Uh oh!

sadiqkhoja Oct 8, 2025

Uh oh!

sadiqkhoja Oct 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		const path = require('path');


		function getSqlFiles(upOrDown) {

Backfill submission geocache, fixes getodk/central#1373 #1623

Are you sure you want to change the base?

Backfill submission geocache, fixes getodk/central#1373 #1623

Uh oh!

Conversation

brontolosone commented Sep 28, 2025

What has been done to verify that this works as intended?

Why is this the best possible solution? Were any other approaches considered?

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

Does this change require updates to the API documentation? If so, please update docs/api.yaml as part of this PR.

Before submitting this PR, please make sure you have:

Uh oh!

sadiqkhoja Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

sadiqkhoja Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

sadiqkhoja Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants