Flush RRD only when TXGs contain data by oshogbo · Pull Request #18138 · openzfs/zfs

oshogbo · 2026-01-16T10:49:08Z

Description

This change modifies the behavior of spa_sync_time_logger when flushing the RRD database.

Previously, once the sync interval elapsed, a flush would always be generated. On solid-state devices, especially when the pool was otherwise idle, this caused disks to wake up solely to write RRD data. Since RRD is best-effort telemetry, this behavior is unnecessary and wasteful.

With this change, spa_sync_time_logger delays flushing until a TXG that already contains data is being synced. The RRD update is appended to that TXG instead of forcing the creation of a new write-only TXG.

During pool export, flushing is forced regardless of whether the TXG contains user data. At that stage, data durability takes precedence and a write must be issued.

This fixes #18082
This change was inspired from @amotin in comments #18120.

Sponsored by: [Wasabi Technology, Inc.; Klara, Inc.]

How Has This Been Tested?

I have added logs to check when the database is flushed and what is the size of database.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Performance enhancement (non-breaking change which improves efficiency)
Code cleanup (non-breaking change which makes code smaller or more readable)
Quality assurance (non-breaking change which makes the code more robust against bugs)
Breaking change (fix or feature that would cause existing functionality to change)
Library ABI change (libzfs, libzfs_core, libnvpair, libuutil and libzfsbootenv)
Documentation (a change to man pages or other documentation)

Checklist:

My code follows the OpenZFS code style requirements.
I have updated the documentation accordingly.
I have read the contributing document.
I have added tests to cover my changes.
I have run the ZFS Test Suite with this change applied.
All commit messages are properly formatted and contain Signed-off-by.

amotin

I am not 100% sure dp_dirty_pertxg at this point reliably means there is nothing to be written in this TXG. It may need a deeper look. But yea, this might be the direction.

module/zfs/spa.c

amotin

I don't have other objections, but I still worry about already mentioned dp_dirty_pertxg. For example, will snapshot creation or something similar, working in sync context, trigger the history update?

This change modifies the behavior of spa_sync_time_logger when flushing the RRD database. Previously, once the sync interval elapsed, a flush would always be generated. On solid-state devices, especially when the pool was otherwise idle, this caused disks to wake up solely to write RRD data. Since RRD is best-effort telemetry, this behavior is unnecessary and wasteful. With this change, spa_sync_time_logger delays flushing until a TXG that already contains data is being synced. The RRD update is appended to that TXG instead of forcing the creation of a new write-only TXG. During pool export, flushing is forced regardless of whether the TXG contains user data. At that stage, data durability takes precedence and a write must be issued. Sponsored by: [Wasabi Technology, Inc.; Klara, Inc.] Signed-off-by: Mariusz Zaborski <mariusz.zaborski@klarasystems.com>

behlendorf

dp_dirty_pertxg is set by dsl_pool_dirty_space() so this should be a reasonable way to quickly check for any dirty data associated with the txg. I believe you're right, it won't account for anything dirtied in syncing context but since this best-effort I don't think that need to hold this up.

amotin · 2026-02-06T21:59:56Z

I think a better way to do it would be to call spa_sync_time_logger() in spa_sync_iterate_to_convergence() between syncing datasets and syncing MOS (now both are inside dsl_pool_sync()). I.e. we need to update the database each time we need to sync dirty MOS after doing everything else. I actually envision slightly bigger refactoring there, since I think it would be beneficial to sync BRT and DDT (and may be doing something else) before syncing MOS to reduce the number of sync iterations. But I need to look on it on a fresh head.

behlendorf · 2026-02-09T19:50:33Z

Agreed, that would be better. @oshogbo can you look at reworking this.

behlendorf · 2026-02-11T19:21:22Z

Actually, let me go ahead and merged this fix as is. It's been tested and resolves the core issue for now. We can refactor is as suggested by @amotin in a future PR to further improve things.

This change modifies the behavior of spa_sync_time_logger when flushing the RRD database. Previously, once the sync interval elapsed, a flush would always be generated. On solid-state devices, especially when the pool was otherwise idle, this caused disks to wake up solely to write RRD data. Since RRD is best-effort telemetry, this behavior is unnecessary and wasteful. With this change, spa_sync_time_logger delays flushing until a TXG that already contains data is being synced. The RRD update is appended to that TXG instead of forcing the creation of a new write-only TXG. During pool export, flushing is forced regardless of whether the TXG contains user data. At that stage, data durability takes precedence and a write must be issued. Sponsored by: [Wasabi Technology, Inc.; Klara, Inc.] Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Mariusz Zaborski <mariusz.zaborski@klarasystems.com> Closes openzfs#18082 Closes openzfs#18138

oshogbo force-pushed the oshogbo/flush_bad branch from 7382076 to dca9998 Compare January 16, 2026 10:50

oshogbo mentioned this pull request Jan 16, 2026

[2.4] TXG timestamp DB sync if idle causes unnecessary disk access/prevent spin down #18082

Closed

amotin requested changes Jan 16, 2026

View reviewed changes

module/zfs/spa.c Outdated Show resolved Hide resolved

module/zfs/spa.c Outdated Show resolved Hide resolved

Bronek mentioned this pull request Jan 17, 2026

Fix unnecessary writes of transaction database #18120

Closed

14 tasks

oshogbo force-pushed the oshogbo/flush_bad branch from dca9998 to eda07a7 Compare January 22, 2026 15:42

oshogbo force-pushed the oshogbo/flush_bad branch from eda07a7 to ff8f278 Compare January 30, 2026 16:44

amotin reviewed Feb 2, 2026

View reviewed changes

behlendorf force-pushed the oshogbo/flush_bad branch from ff8f278 to 8b54145 Compare February 5, 2026 02:04

behlendorf self-requested a review February 5, 2026 02:05

behlendorf approved these changes Feb 6, 2026

View reviewed changes

behlendorf added the Status: Accepted Ready to integrate (reviewed, tested) label Feb 6, 2026

behlendorf requested a review from amotin February 6, 2026 18:13

behlendorf added Status: Revision Needed Changes are required for the PR to be accepted and removed Status: Accepted Ready to integrate (reviewed, tested) labels Feb 9, 2026

behlendorf added Status: Accepted Ready to integrate (reviewed, tested) and removed Status: Revision Needed Changes are required for the PR to be accepted labels Feb 11, 2026

behlendorf merged commit cdf89f4 into openzfs:master Feb 11, 2026
38 of 41 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flush RRD only when TXGs contain data#18138

Flush RRD only when TXGs contain data#18138
behlendorf merged 1 commit intoopenzfs:masterfrom
oshogbo:oshogbo/flush_bad

oshogbo commented Jan 16, 2026 •

edited

Loading

Uh oh!

amotin left a comment

Uh oh!

Uh oh!

Uh oh!

amotin left a comment

Uh oh!

behlendorf left a comment •

edited

Loading

Uh oh!

amotin commented Feb 6, 2026

Uh oh!

behlendorf commented Feb 9, 2026

Uh oh!

behlendorf commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

oshogbo commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How Has This Been Tested?

Types of changes

Checklist:

Uh oh!

amotin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

amotin left a comment

Choose a reason for hiding this comment

Uh oh!

behlendorf left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

amotin commented Feb 6, 2026

Uh oh!

behlendorf commented Feb 9, 2026

Uh oh!

behlendorf commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

oshogbo commented Jan 16, 2026 •

edited

Loading

behlendorf left a comment •

edited

Loading