improved TUF artifact replication robustness #7519

iliana · 2025-02-11T19:21:09Z

Closes #7399.

Nexus now owns and maintains a generation number for the set of artifacts the system wants to be fully replicated, which is used by Sled Agent to prevent conflicts. The generation number is stored in a new singleton table based on the existing db_metadata singleton. I wrote up docs/tuf-artifact-replication.adoc to provide a top-level overview of the system and some of the conflicts that this refactor seeks to prevent.

The Sled Agent artifact store APIs are modified. Two new APIs exist for getting and putting an "artifact configuration", which is the list of wanted artifacts and its associated generation number. The list request returns the current generation number as well, and the PUT and "copy from depot" requests require an up-to-date generation number in the query string. The delete API is removed in favor of Sled Agent managing deletions on its own whenever the configuration is updated.

iliana · 2025-02-11T19:22:46Z

common/src/api/external/mod.rs

+// This is the equivalent of applying `#[serde(transparent)]`, but that has a
+// side effect of changing the JsonSchema derive to no longer emit a schema.
+impl Serialize for Generation {
+    fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
+    where
+        S: serde::Serializer,
+    {
+        self.0.serialize(serializer)
+    }
+}


I want to call out this change -- I believe there is a bug in progenitor 0.9.x where newtype structs that do not have #[serde(transparent)] cannot be serialized in a query string, and I need to go file an issue for it. But I think in practice it is more accurate to manually implement Serialize in Omicron. In practice this change does not affect existing JSON serialization because serde_json treats newtype structs as their inner value.

…n-generation-numbers

andrewjstone

I've only taken a look at the docs so far, but looks solid! Thanks for writing that up. I'll finish the review later or tomorrow.

docs/tuf-artifact-replication.adoc

nexus/db-queries/src/db/datastore/update.rs

andrewjstone · 2025-02-25T19:31:26Z

nexus/src/app/background/tasks/tuf_artifact_replication.rs

-    ) -> Result<Inventory> {
+    ) -> Result<(ArtifactConfig, Inventory)> {
+        let generation =
+            self.datastore.update_tuf_generation_get(opctx).await?;
        let mut inventory = Inventory::default();
        let mut paginator = Paginator::new(SQL_BATCH_SIZE);
        while let Some(p) = paginator.next() {


When we call this, the generation can change out underneath, or new artifacts can be added but we will already have read the old generation. This stems from the fact that the generation is not coupled to any set of artifacts and so you don't know in the database what artifact a generation is tied to. They are updated independently and read independently. There needs to be some kind of logical mapping exposed in the database.

How do you logically map a generation number to deleted artifacts?

It might also be possible to get the artifact list and the generation number simultaneously via a JOIN or a transaction (I'm not sure, I don't dabble in CRDB consistency much...).

Sorry, this was based on my faulty understanding above where I didn't see that writes were in a transaction. I think you can slap the generation read and pagination in a transaction and this should solve the issue here.

I decided that putting pagination within a transaction seemed arduous. So, I have added a logical mapping with a generation_added column. This will ensure reading the list of artifacts in a generation will be consistent even if another generation is added in between pages.

The plan is still to delete artifact rows when all the repos referencing them are deleted, but we can change that plan and add a generation_deleted column later instead, too.

I could use an extra (few) sets of eyes looking at the modified implementation of DataStore::update_tuf_repo_insert (more specifically the insert_impl function). In particular, we start the transaction by fetching the current generation and selecting the new generation, and filling in the generation_added field in all the artifacts:

omicron/nexus/db-queries/src/db/datastore/update.rs

Lines 181 to 187 in 41c5d11

// Load the current generation from the database and increment it, then

// use that when creating the `TufRepoDescription`. If we determine there

// are any artifacts to be inserted, we update the generation to this value

// later.

let old_generation = get_generation(&conn).await?;

let new_generation = old_generation.next();

let desc = TufRepoDescription::from_external(desc.clone(), new_generation);

Then if we determine new artifacts are to be inserted, we put the new generation number:

omicron/nexus/db-queries/src/db/datastore/update.rs

Lines 311 to 325 in 41c5d11

if !new_artifacts.is_empty() {

// Since we are inserting new artifacts, we need to bump the

// generation number.

debug!(log, "setting new TUF repo generation";

"generation" => new_generation,

);

put_generation(&conn, old_generation.into(), new_generation.into())

.await?;

// Insert new artifacts into the database.

diesel::insert_into(dsl::tuf_artifact)

.values(new_artifacts)

.execute_async(&conn)

.await?;

}

Which will only update the generation if it's currently the old generation, and returns an error if no rows were updated:

omicron/nexus/db-queries/src/db/datastore/update.rs

Lines 373 to 389 in 41c5d11

async fn put_generation(

conn: &async_bb8_diesel::Connection<crate::db::DbConnection>,

old_generation: nexus_db_model::Generation,

new_generation: nexus_db_model::Generation,

) -> Result<nexus_db_model::Generation, DieselError> {

use db::schema::tuf_generation::dsl;

// We use `get_result_async` instead of `execute_async` to check that we

// updated exactly one row.

diesel::update(dsl::tuf_generation.filter(

dsl::singleton.eq(true).and(dsl::generation.eq(old_generation)),

))

.set(dsl::generation.eq(new_generation))

.returning(dsl::generation)

.get_result_async(conn)

.await

}

I'm not 100% sure this is the right way to do this; if the generation number is incremented by another transaction first, I would prefer to retry this transaction than return an unretryable error. I don't understand enough about whether CockroachDB would detect this as a transaction conflict and tell us to retry it.

I like this strategy a lot and think it should work with the serializable constraints of the DB. Thanks for adding this support!

I could use an extra (few) sets of eyes looking at the modified implementation of DataStore::update_tuf_repo_insert (more specifically the insert_impl function). In particular, we start the transaction by fetching the current generation and selecting the new generation, and filling in the generation_added field in all the artifacts:

I took a pretty close look and it looks great AFAICT.

I'm not 100% sure this is the right way to do this; if the generation number is incremented by another transaction first, I would prefer to retry this transaction than return an unretryable error. I don't understand enough about whether CockroachDB would detect this as a transaction conflict and tell us to retry it.

I don't think CRDB will retry for us, but I'm not really sure. Would it be worth adding a test for this?

schema/crdb/dbinit.sql

nexus/src/app/background/tasks/tuf_artifact_replication.rs

docs/tuf-artifact-replication.adoc

nexus/db-queries/src/db/datastore/update.rs

…n-generation-numbers

iliana · 2025-03-05T18:36:15Z

This most recent push only includes a merge from main and some of the docs nits; I'm going to be working on writing the generation number out to a ledger on the filesystem and making the artifact list query more consistent.

…n-generation-numbers

sled-agent/src/artifact_store.rs

nexus/src/app/background/tasks/tuf_artifact_replication.rs

…n-generation-numbers

iliana · 2025-03-20T17:13:01Z

I think the only thing outstanding here is how well the transaction retries based on how the query is currently written. I'm tempted to open an issue to track resolving that since this PR is pretty long-lived now.

andrewjstone

Ship it. Thanks for all the hard work on this @iliana!

andrewjstone · 2025-03-20T17:17:21Z

I think the only thing outstanding here is how well the transaction retries based on how the query is currently written. I'm tempted to open an issue to track resolving that since this PR is pretty long-lived now.

Sounds good to me.

iliana added 5 commits February 11, 2025 19:11

impl slog::Value for Generation

177c220

artifact store API robustness

18c8652

queries for storing/incrementing TUF generation

7f65a2e

wire up tuf_artifact_replication to the new API

0e9fc54

write an overview doc

2a8aeba

iliana requested a review from davepacheco February 11, 2025 19:21

iliana commented Feb 11, 2025

View reviewed changes

iliana added 2 commits February 11, 2025 20:15

i always forget

5392f81

fix some control flow

068d8de

iliana mentioned this pull request Feb 19, 2025

add image source to OmicronZoneConfig #7555

Merged

Merge remote-tracking branch 'origin/main' into iliana/tuf-replicatio…

a1074e0

…n-generation-numbers

andrewjstone reviewed Feb 25, 2025

View reviewed changes

davepacheco reviewed Feb 27, 2025

View reviewed changes

docs/tuf-artifact-replication.adoc Outdated Show resolved Hide resolved

nexus/db-queries/src/db/datastore/update.rs Outdated Show resolved Hide resolved

iliana added 2 commits March 5, 2025 18:09

Merge remote-tracking branch 'origin/main' into iliana/tuf-replicatio…

39d0474

…n-generation-numbers

docs nits

8e31247

iliana added 4 commits March 5, 2025 20:38

store the generation number on a ledger

3b822db

nit

ca517e2

ensure consistent reads of tuf_artifact table

9dc56b1

Merge remote-tracking branch 'origin/main' into iliana/tuf-replicatio…

41c5d11

…n-generation-numbers

andrewjstone reviewed Mar 10, 2025

View reviewed changes

iliana added 2 commits March 19, 2025 18:02

Merge remote-tracking branch 'origin/main' into iliana/tuf-replicatio…

7009b88

…n-generation-numbers

ledger management task; rename ledger file

c84a598

andrewjstone approved these changes Mar 20, 2025

View reviewed changes

iliana mentioned this pull request Mar 20, 2025

inspect TUF repo generation transaction logic more closely #7844

Open

iliana merged commit c31dc2f into main Mar 20, 2025
17 checks passed

iliana deleted the iliana/tuf-replication-generation-numbers branch March 20, 2025 17:51

	// Load the current generation from the database and increment it, then
	// use that when creating the `TufRepoDescription`. If we determine there
	// are any artifacts to be inserted, we update the generation to this value
	// later.
	let old_generation = get_generation(&conn).await?;
	let new_generation = old_generation.next();
	let desc = TufRepoDescription::from_external(desc.clone(), new_generation);

	if !new_artifacts.is_empty() {
	// Since we are inserting new artifacts, we need to bump the
	// generation number.
	debug!(log, "setting new TUF repo generation";
	"generation" => new_generation,
	);
	put_generation(&conn, old_generation.into(), new_generation.into())
	.await?;

	// Insert new artifacts into the database.
	diesel::insert_into(dsl::tuf_artifact)
	.values(new_artifacts)
	.execute_async(&conn)
	.await?;
	}

	async fn put_generation(
	conn: &async_bb8_diesel::Connection<crate::db::DbConnection>,
	old_generation: nexus_db_model::Generation,
	new_generation: nexus_db_model::Generation,
	) -> Result<nexus_db_model::Generation, DieselError> {
	use db::schema::tuf_generation::dsl;

	// We use `get_result_async` instead of `execute_async` to check that we
	// updated exactly one row.
	diesel::update(dsl::tuf_generation.filter(
	dsl::singleton.eq(true).and(dsl::generation.eq(old_generation)),
	))
	.set(dsl::generation.eq(new_generation))
	.returning(dsl::generation)
	.get_result_async(conn)
	.await
	}

improved TUF artifact replication robustness #7519

improved TUF artifact replication robustness #7519

Uh oh!

Conversation

iliana commented Feb 11, 2025

Uh oh!

iliana Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrewjstone left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

iliana commented Mar 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

iliana commented Mar 20, 2025

Uh oh!

andrewjstone left a comment

Choose a reason for hiding this comment

Uh oh!

andrewjstone commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

iliana Feb 11, 2025 •

edited

Loading