RFC: CryptoVec mlock behavior - should it be secure by default or opt-in?

  CryptoVec currently mlocks all allocated memory to prevent swapping sensitive data to disk. While secure, this has significant performance overhead for high-throughput use cases like SFTP file transfers.

  Profiling shows mlock/munlock syscalls using a lot of CPU time in one of my projects (25%+) when most of the data flowing through buffers isn't actually cryptographic secrets - it's packet payloads, channel data (stdin/stdout), etc.

  ## OpenSSH's approach

  OpenSSH's `sshbuf` uses non-mlocked buffers by default:
  - `sshbuf_new()` at [packet.c:245-248](https://github.com/openssh/openssh-portable/blob/master/packet.c#L245-L248) creates
  buffers for `incoming_packet`, `outgoing_packet`, etc. **without** mlock
  - Channel data at [channels.c:3479](https://github.com/openssh/openssh-portable/blob/master/channels.c#L3479) uses regular
  buffers
  - Sensitive data (private keys, shared secrets) gets explicit secure handling via `freezero()`
  ([misc.c](https://github.com/openssh/openssh-portable/blob/master/openbsd-compat/freezero.c))

  ## Proposed changes

  ### 1. Consistent zeroization

  The `resize()` truncation path uses [`memset` at cryptovec.rs:216](https://github.com/Eugeny/russh/blob/main/cryptovec/src/cryptovec.rs#L216) which could be optimized away by the compiler. However, reallocation uses [`zeroize` at line 230](https://github.com/Eugeny/russh/blob/main/cryptovec/src/cryptovec.rs#L230) and Drop uses [`zeroize` at line 367](https://github.com/Eugeny/russh/blob/main/cryptovec/src/cryptovec.rs#L367), both with the [optimization barrier at line 383](https://github.com/Eugeny/russh/blob/main/cryptovec/src/cryptovec.rs#L383).

  Should use `zeroize()` consistently in all paths, matching:
  - OpenSSH's [explicit_bzero](https://github.com/openssh/openssh-portable/blob/master/openbsd-compat/explicit_bzero.c) pattern
  - RustCrypto's [optimization_barrier approach](https://github.com/RustCrypto/utils/pull/1252) (see also [issue
  #1269](https://github.com/RustCrypto/utils/issues/1269))

  ### 2. Conditional mlock with categorized buffers

  Add a `secure: bool` field to CryptoVec and categorize buffers:

   ### Should be mlocked (secrets)

  - **Shared secrets (K)** - passed to `kex_derive_keys()`, cleared with `explicit_bzero` after use
  ([kex.c:1123-1145](https://github.com/openssh/openssh-portable/blob/master/kex.c#L1123-L1145))
  - **Derived session keys** - `newkeys->enc.key`, `newkeys->mac.key` cleared with `explicit_bzero`, struct freed with `freezero`   ([kex.c:690-711](https://github.com/openssh/openssh-portable/blob/master/kex.c#L690-L711))
  - **Exchange hash working buffer** - contains K during derivation via `ssh_digest_update_buffer(hashctx, shared_secret)`
  ([kex.c:1078-1097](https://github.com/openssh/openssh-portable/blob/master/kex.c#L1078-L1097))
  - **Agent lock password** - `lock_pwhash` cleared with `explicit_bzero` after use
  ([ssh-agent.c:1474-1485](https://github.com/openssh/openssh-portable/blob/master/ssh-agent.c#L1474-L1485))

  ### Should NOT be mlocked (not secrets)

  - **Packet I/O buffers** - `incoming_packet`, `outgoing_packet` created with `sshbuf_new()`, no mlock
  ([packet.c:245-248](https://github.com/openssh/openssh-portable/blob/master/packet.c#L245-L248))
  - **Channel data** - `c->input`, `c->output`, `c->extended` created with `sshbuf_new()`, no mlock
  ([channels.c:540-542](https://github.com/openssh/openssh-portable/blob/master/channels.c#L540-L542))
  - **Decompression buffer** - `compression_buffer` created with `sshbuf_new()`, no mlock
  ([packet.c:800-801](https://github.com/openssh/openssh-portable/blob/master/packet.c#L800-L801))
  - **Public keys** - regular `sshbuf` allocation, `sshbuf_free` uses `freezero` for zeroization but no mlock
  ([sshbuf.c:191-192](https://github.com/openssh/openssh-portable/blob/master/sshbuf.c#L191-L192))

  Note: OpenSSH's `sshbuf_new()` ([sshbuf.c:93](https://github.com/openssh/openssh-portable/blob/master/sshbuf.c#L93)) does not
  mlock, but `sshbuf_free()` ([sshbuf.c:191](https://github.com/openssh/openssh-portable/blob/master/sshbuf.c#L191)) uses
  `freezero()` to zero memory. The model is: zero everything on free, but only mlock actual secrets.

  ## Question: What should the API default be?

  **Option A: Secure by default**

  ```rust
  CryptoVec::new()                  // mlocked
  CryptoVec::new_unlocked()         // not mlocked, for non-secrets
  CryptoVec::from_slice_unlocked()  // not mlocked
  CryptoVec::from_vec_unlocked()    // not mlocked
```
  - Safer for library consumers who may not think about security
  - External code using CryptoVec would need to opt-out for performance

  Option B: Performance by default (matches OpenSSH)
```rust
  CryptoVec::new()           // not mlocked (like sshbuf_new())
  CryptoVec::new_secure()    // mlocked, for secrets
  CryptoVec::from_slice_secure()
  CryptoVec::from_vec_secure()
```
  - Matches OpenSSH's model
  - External code would need to opt-in for security




Which approach do you prefer? I'm happy to implement either way.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RFC: CryptoVec mlock behavior - should it be secure by default or opt-in? #629

OpenSSH's approach

Proposed changes

1. Consistent zeroization

2. Conditional mlock with categorized buffers

Should be mlocked (secrets)

Should NOT be mlocked (not secrets)

Question: What should the API default be?

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

RFC: CryptoVec mlock behavior - should it be secure by default or opt-in? #629

Description

OpenSSH's approach

Proposed changes

1. Consistent zeroization

2. Conditional mlock with categorized buffers

Should be mlocked (secrets)

Should NOT be mlocked (not secrets)

Question: What should the API default be?

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions