Skip to content

prudent-rs/ndd

GitHub Actions results

Summary

Purpose

ndd (Non-De-Duplicated) is a zero-cost transparent wrapper for static variables that do not share memory with any other static or const (or local) variables (or literals). Use for static data (single variables/arrays/slices) referenced with references/slices/pointers that are compared by address.

Use

Use ndd::NonDeDuplicated to wrap your static data (other than string literals (&str) or C string literal bytes). Use it for (immutable) static variables only.

Use ndd::NonDeDuplicatedStr and ndd::NonDeDuplicatedCStr to wrap static string slices (&str) and C strings (owned bytes that defer to &CStr). These two types need a const generic parameter N, which is the length (in bytes). There is no way around this (on stable Rust). On nightly Rust you can use ndd::infer:NonDeDuplicatedStr and ndd::infer:NonDeDuplicatedCStr from odd-numbered (-nightly) version of ndd instead.

See unit tests in src/lib.rs, and demo_fix/callee/src/lib.rs.

Problem

Rust (or, rather, LLVM) by default de-duplicates or reuses addresses of static variables in release builds. And somewhat in dev (debug) builds, too. For most purposes that is good: The result binary is smaller, and because of more successful cache hits, the execution is faster.

However, that is counter-productive when the code identifies/compares static data by memory address of the reference (whether a Rust reference/slice, a pointer/pointer range, or the pointer casted to usize). For example, an existing Rust/3rd party API may accept ("ordinary") references/slices. You may want to extend that API's protocol/behavior with signalling/special handling when the client sends in your designated static variable by reference/slice/pointer/pointer range. (Your special handler may cast such references/slices to pointers and compare them by address with core::ptr::eq or core::ptr::addr_eq.)

Then you do not want the client, nor the compiler/LLVM, to reuse/share the memory address of such a designated static for any other ("ordinary") static or const values/expressions, or local numerical/character/byte/string/c-string slice literals. Otherwise an "ordinary" invocation of the API could trigger your designated signalling unintentionally.

That does work out of the box when the client passes a reference/slice defined as static: each static gets its own memory space (even with the default release optimizations). See a test src/lib.rs -> tests_without_ndd -> addresses_unique_between_statics().

However, there is a problem (caused by de-duplication in release builds, and for some types even in dev or MIRI). It affects ("ordinary") const values/expressions that equal in value to any static (whether it's a static variable, or a static literal), which may be your designated static. Rust/LLVM re-uses address of one such matching static for references to any equal value(s) defined as const. See src/lib.rs -> tests_without_ndd -> u8_global_const_and_global_static_release(). Such const, static or literal could be in 3rd party code, even private. (See demo_bug/.)

Things get worse: dev builds don't have this consistent:

  • For some types (u8, numeric primitive-based enums) dev builds don't reuse static addresses for references/slices to const values. But
  • For other types (str), dev builds do reuse them...

MIRI reuses static addresses even less (than dev does), but it still does reuse them sometimes

  • for example, between byte (&CStr) literals (b"Hello") and equal string (&str) literals (technically, subslices: "Hello").

Even worse so: release builds don't have this consistent. De-duplication across crates depends on "fat" link time optimization (LTO):

[profile.release]
lto = "fat"

For dev builds cross-crate de-duplication depends on "fat" link time optimization (LTO) AND opt-level being 2 or higher:

[profile.dev]
lto = "fat"
opt-level = 2

Solution

ndd::NonDeDuplicated uses core::cell::Cell to hold the data passed in by the user. There is no mutation and no mutation access. The only access it gives to the inner data is through shared references.

Unlike core::cell::Cell (and friends), NonDeDuplicated does implement core::marker::Sync (if the inner data's type implements core::marker::Send and core::marker::Sync). It can safely do so, because it never provides mutable access, and it never mutates the inner data. That is similar to how std::sync::Mutex implements core::marker::Sync, too.

See src/lib.rs -> tests_with_ndd.

Compatibility

ndd is no_std-compatible and it doesn't need heap (alloc) either. Release versions (even-numbered major versions, and not -nightly pre-releases) compile with stable Rust. (More below.)

Do not use it for locals or on heap. That is validated by implementation of core::ops::Drop, which panics in dev builds.

Always forward compatible

ndd is planned to be always below version 1.0. So stable (even-numbered) versions will be forward compatible. (If a need ever arises for big incompatibility, that can go in a new crate.)

That allows you to specify ndd as a dependency with version 0.*, which will match ANY major versions (below 1.0, of course). That will match the newest (even-numbered major) stable version (available for your Rust) automatically.

This is special only to 0.* - it is not possible to have a wildcard matching various major versions 1.0 or higher.

Versioning schema

  • Even-numbered major versions (0.2, 0.4...)

    • are for stable functionality only.
    • don't use any pre-release identifier (so, nothing like 0.4-alpha).
    • here they are called "stable", but the version name/identifier doesn't include "stable" word.
  • Odd-numbered major versions (0.3, 0.5...)

    • always contain -nightly (pre-release identifier) in their name.

    • are, indeed, for nightly (unstable) functionality, and need nightly Rust toolchain (indicated with rust-toolchain.toml which is present on nightly GIT branch only).

    • include functionality already present in some lower stable versions. Not all of them - only:

      • stable versions with a lower major numeric version, and
      • if the stable major version is lower by 0.1 only (and not by more), then the stable minor version has to be the same or lower (than minor version of the odd-numbered (-nightly)).

      So if x < z

      • 0.x.y (stable)
      • 0.z.y-nightly is (indeed) nightly
        • 0.z.y-nightly includes all functionality already present in 0.x.y (stable).
      • But, if x + 0.1 == z and y < w
        • 0.z.y-nightly does not include any functionality new in 0.x.w (stable), because it was not present in 0.x.y yet).

      Examples:

      • 0.2.1 (stable)
      • 0.3.1-nightly
        • 0.3.1-nightly includes functionality present in 0.2.1 (stable).
      • 0.2.2 (stable)
      • 0.3.2-nightly
        • 0.3.2-nightly includes functionality present in 0.2.2 (if they get published), BUT:
      • 0.2.1 (stable)
      • 0.3.1-nightly
        • 0.3.1-nightly will not include functionality present in 0.2.2 that was not present in 0.2.1.
  • If needed and if practical, new major versions will use the SemVer trick. See also The Cargo Book > Dependency Resolution.

    However, the only types exported from ndd is ndd::NonDeDuplicated, ndd::NonDeDuplicatedStr and ndd::NonDeDuplicatedCStr. They are zero-cost wrappers suitable for immutable static variables. They are normally not being passed around as a parameter/return type or a composite type. And their functions can get inlined/optimized away. So, there shouldn't be any big binary size/speed difference, or usability difference, if there happen to be multiple major versions of ndd crate in use at the same time. They would be all isolated. So SemVer trick may be unnecessary.

Crate version is validated by GIT pre-commit and by GitHub Actions.

Rule of thumb for stable versions

On stable Rust, always specify ndd with version 0.*. Then, automatically:

  • you will get the newest available even-numbered major (stable) version, and
  • your libraries will work with any newer odd-numbered major (-nightly) version of ndd, too, if any dependency (direct or transitive) requires it.

Rule of thumb for unstable versions

To find out the highest even-numbered (stable) version whose functionality is included in a given odd-numbered (-nightly) version, decrement the odd-numbered version by 0.1 (and remove the -nightly suffix).

Nightly versioning

We prefer not to introduce temporary cargo features. Removing a feature later is a breaking change. And we don't want just to make such a feature no-op and let it sit around.

So, instead, any nightly-only functionality has separate versions that always

As per Rust resolver rules, a stable (non-pre-release) version will NOT match/auto-update to a pre-release version on its own. Therefore, if your crate and/or its dependencies specify ndd version as 0.*, they will not accidentally request an odd-numbered (-nightly) major on their own.

They can get a (-nightly) version, but only if another crate requires it. That's up to the consumer.

If you want more control over stable versions, you can fix the even-numbered major version, and use an asterisk mask for the minor version, like 0.2.*. But then you lose automatic major updates.

Nightly functionality

WARNING: Functionality of odd-numbered major (-nightly) versions is always subject to change!

The following extra functionality is available on 0.3.5-nightly. Of course, you need nightly Rust toolchain.

as_array_of_cells

ndd::NonDeDuplicated has function as_array_of_cells, similar to Rust's core::cell::Cell::as_array_of_cells (which will, hopefully, become stable in Rust 1.91).

as_slice_of_cells

Similar to as_array_of_cells, ndd::NonDeDuplicated has function as_slice_of_cells. That can be stable with with Rust 1.88+. However, to simplify versioning, it's bundled in -nightly together with as_array_of_cells. and may become stable at the same time. If you need it earlier, get in touch.

const Deref and From

core::ops::Deref and core::convert::From are implemented as const. As of mid 2025, const traits are having high traction in Rust. Hopefully this will be stable not in years, but sooner.

These traits are not implemented in stable versions at all. Why? Because ndd types are intended for static variables, so non-const functions don't help us.

Quality assurance

Checks and tests are run by GitHub Actions. See results. All scripts run on Alpine Linux (without libc, in a rust:1.87-alpine container) and are POSIX-compliant:

Use cases

Used by hash-injector::signal.

Updates

Please subscribe for low frequency updates at peter-lyons-kehl/ndd/issues#2.

Side fruit

The following side fruit is std-only, but related: std::sync::mutex::data_ptr(&self) is now a const function: pull request rust-lang/rust#146904.

About

Non-de-duplicated read-only Rust cell. For static with unique addresses.

Resources

License

Apache-2.0 and 2 other licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
BSD-2-Clause
LICENSE-BSD
MIT
LICENSE-MIT

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published