DAB-LABS
diff --git a/‎CHANGELOG.md‎
Lines changed: 23 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 23 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 2 additions & 0 deletions b/‎README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎agent/filesystem_btrfs.go‎
Lines changed: 141 additions & 0 deletions b/‎agent/filesystem_btrfs.go‎
Lines changed: 141 additions & 0 deletions
diff --git a/‎agent/filesystem_btrfs_test.go‎
Lines changed: 174 additions & 0 deletions b/‎agent/filesystem_btrfs_test.go‎
Lines changed: 174 additions & 0 deletions
diff --git a/‎agent/filesystem_unix.go‎
Lines changed: 29 additions & 0 deletions b/‎agent/filesystem_unix.go‎
Lines changed: 29 additions & 0 deletions
@@ -2,6 +2,29 @@
 
 All notable changes to SMART Sniffer are documented here.
 
+## v0.5.6 -- 2026-05-02
+
+Agent + integration release. Both components updated.
+
+### Fixed
+- **Wear level sensor now reports consistent "percentage used" across ATA and NVMe drives** -- ATA SSDs report a normalized SMART value where 100 means "new" and 0 means "worn." NVMe drives report `percentage_used` where 0 means "new" and 100 means "worn" -- the opposite scale. Previously, the integration passed both values through as-is, so the same sensor meant opposite things depending on drive protocol. Now ATA values are inverted to match NVMe: 0% = new drive, 100% = fully worn. **Breaking change:** if you have automations based on the ATA wear sensor, your values will invert (e.g., a new Samsung 870 EVO previously showed 99, now shows 1).
+- **Installer filesystem picker: bind mount deduplication** -- the picker now parses `/proc/self/mountinfo` (when available) and deduplicates entries by `(source, fstype, root)`. Previously, bind mounts of the same filesystem appeared as separate entries, tripling the list on systems like ZimaOS. Falls back to `/proc/mounts` and then `mount` on systems where mountinfo is not available.
+- **Installer filesystem picker: path unescaping** -- mount paths containing spaces, tabs, newlines, or backslashes are now displayed correctly. The kernel escapes these characters in mountinfo/proc output (`\040` for space, etc.) and the installer previously showed the raw escaped strings.
+- **Installer summary now shows the correct IP** -- the post-install summary previously showed the first IP from `hostname -I`, which on systems with Docker bridges or virtual interfaces was often an unreachable internal IP (e.g., `172.18.0.1`). It now shows the IP of the mDNS-advertised interface you selected during setup.
+
+### Added
+- **ATA SSD wear now triggers attention warnings** -- ATA SSDs with wear level at 90% or higher (after inversion to "percentage used") now fire a WARNING in the Attention Needed sensor, matching the existing NVMe threshold. Previously only NVMe drives got wear-based attention warnings.
+- **btrfs filesystem fallback** -- when `statvfs` returns zero for a btrfs mount (a known quirk on some multi-device or DUP-profile configurations), the agent falls back to `btrfs filesystem usage --raw` for accurate size/usage data. Requires `btrfs-progs` to be installed (most btrfs systems have it). Without it, the mount reports as `(unknown size)` in the picker and zero-byte usage from the API.
+- **Installer: bind mount hiding** -- bind mounts of subdirectories are hidden by default behind a `[+N bind mounts hidden]` tag with a y/N prompt to reveal them. Reduces noise on systems with many bind mounts.
+
+### Changed
+- **Sensor name:** "Wear Leveling / Percentage Used" renamed to "Wear Level (% Used)" for clarity.
+
+### Upgrade Notes
+- **Both agent and integration should be updated.** Replace the agent binary or re-run the installer. Update the integration via HACS or manually.
+- **Wear sensor breaking change:** ATA SSD wear values are inverted. If you have automations checking wear level, review your thresholds. The sensor now consistently means "percentage of rated life consumed" for both ATA and NVMe. A new drive reads ~0-1%, a heavily worn drive reads 90%+.
+- **btrfs users:** install `btrfs-progs` if not already present for accurate filesystem reporting. The agent works without it but btrfs mounts will show zero usage.
+
 ## v0.5.5.5 -- 2026-04-27
 
 Installer-only patch. No agent, integration, or config changes.
 
@@ -224,6 +224,8 @@ Binaries output to `agent/build/`.
 
 **Requires:** `smartmontools` **7.0+** on each monitored machine (for JSON output support). The installer handles installation automatically (Homebrew on macOS, apt/dnf/yum on Linux), but some older distros ship smartctl 6.x which does not support the `--json` flag the agent relies on. Run `smartctl --version` to check. If you're on 6.x, install a newer version from the [smartmontools releases page](https://www.smartmontools.org/wiki/Download) or from a backports repository.
 
+**Optional:** `btrfs-progs` is recommended on systems with btrfs filesystems. The installer's disk-usage picker and the agent's `/api/filesystems` endpoint both fall back to `btrfs filesystem usage --raw` when `statvfs` returns zero on a btrfs mount (a known quirk on some multi-device or near-full configurations). Without `btrfs-progs`, btrfs entries display as `(unknown size)` in the picker and report zero-byte usage from the API. Most distros include `btrfs-progs` by default if any btrfs filesystems exist on the system.
+
 ### 1. Install the agent
 
 Run on each machine you want to monitor:
 
@@ -0,0 +1,141 @@
+//go:build !windows
+
+// Phase 1A: btrfs statvfs fallback.
+//
+// Some btrfs configurations (multi-device, certain kernel versions,
+// near-full single-disk) cause syscall.Statfs to return zero values
+// where df-style tools would report real numbers. When that happens,
+// we fall back to parsing `btrfs filesystem usage --raw <path>`.
+//
+// This is a fallback, not the primary source. statvfs is microseconds;
+// a subprocess is milliseconds and forks a child. We only invoke btrfs
+// when statvfs has clearly failed (total==0 on a btrfs mount).
+//
+// Three failure modes, each with a distinct log line so users can
+// diagnose without reading source:
+//   - btrfs-progs not installed
+//   - subprocess timed out (5s)
+//   - output didn't parse
+//
+// All three fall through to the original statvfs values (zero); the
+// /api/filesystems endpoint reports zeros and continues working.
+package main
+
+import (
+	"bytes"
+	"context"
+	"errors"
+	"fmt"
+	"os/exec"
+	"regexp"
+	"strconv"
+	"time"
+)
+
+// btrfsFallbackTimeout is the maximum wall time we'll allow the
+// `btrfs filesystem usage --raw` subprocess. A hung btrfs binary
+// must not block the agent's poll cycle.
+const btrfsFallbackTimeout = 5 * time.Second
+
+// Sentinel errors for the three documented failure modes. Callers
+// distinguish via errors.Is to emit the right log message.
+var (
+	errBtrfsProgsMissing = errors.New("btrfs-progs not installed")
+	errBtrfsTimeout      = errors.New("btrfs filesystem usage timed out")
+	errBtrfsParse        = errors.New("btrfs filesystem usage parse error")
+)
+
+// btrfsUsage holds the three values we extract from --raw output.
+type btrfsUsage struct {
+	Total     uint64
+	Used      uint64
+	Available uint64
+}
+
+// Anchor patterns for the bare lines in the Overall: block of
+// `btrfs filesystem usage --raw` output. Each must end after the
+// digits to avoid colliding with the per-block-group lines such as
+// "Data,single: Size:N, Used:N (62.40%)" -- those have "Used:" mid-line.
+var (
+	reBtrfsDeviceSize = regexp.MustCompile(`(?m)^\s*Device size:\s+(\d+)\s*$`)
+	reBtrfsUsed       = regexp.MustCompile(`(?m)^\s*Used:\s+(\d+)\s*$`)
+	// "Free (estimated):" has an optional trailing "(min: N)" parenthetical.
+	// We only want the first integer; the "min:" value is conservative
+	// scheduling info we don't expose.
+	reBtrfsFreeEst = regexp.MustCompile(`(?m)^\s*Free \(estimated\):\s+(\d+)`)
+)
+
+// tryBtrfsFallback runs `btrfs filesystem usage --raw <path>` and
+// parses the result. Returns the typed error sentinels documented
+// above so the caller can log the three distinct messages.
+func tryBtrfsFallback(path string) (btrfsUsage, error) {
+	// Cheap pre-check: if the binary isn't even on PATH, fail fast
+	// with the specific sentinel. exec.LookPath is microseconds.
+	if _, err := exec.LookPath("btrfs"); err != nil {
+		return btrfsUsage{}, errBtrfsProgsMissing
+	}
+
+	ctx, cancel := context.WithTimeout(context.Background(), btrfsFallbackTimeout)
+	defer cancel()
+
+	cmd := exec.CommandContext(ctx, "btrfs", "filesystem", "usage", "--raw", path)
+	var stdout, stderr bytes.Buffer
+	cmd.Stdout = &stdout
+	cmd.Stderr = &stderr
+	if err := cmd.Run(); err != nil {
+		// Distinguish timeout from other failures. A context-cancelled
+		// CommandContext returns ctx.Err() via the Go stdlib.
+		if ctx.Err() == context.DeadlineExceeded {
+			return btrfsUsage{}, errBtrfsTimeout
+		}
+		// Any other run error (non-zero exit, permission denied,
+		// disappeared mountpoint) is treated as a parse-class failure
+		// from the caller's perspective. Wrap so the caller sees the
+		// underlying cause if they choose to inspect it.
+		return btrfsUsage{}, fmt.Errorf("%w: %v", errBtrfsParse, err)
+	}
+
+	return parseBtrfsUsageRaw(stdout.Bytes())
+}
+
+// parseBtrfsUsageRaw extracts Device size, Used, and Free (estimated)
+// from the --raw output. Exposed (unexported but package-visible) for
+// unit tests so we don't need a real btrfs binary to test parsing.
+func parseBtrfsUsageRaw(out []byte) (btrfsUsage, error) {
+	totalMatch := reBtrfsDeviceSize.FindSubmatch(out)
+	usedMatch := reBtrfsUsed.FindSubmatch(out)
+	freeMatch := reBtrfsFreeEst.FindSubmatch(out)
+
+	if totalMatch == nil || usedMatch == nil {
+		return btrfsUsage{}, fmt.Errorf("%w: missing Device size or Used line", errBtrfsParse)
+	}
+
+	total, err := strconv.ParseUint(string(totalMatch[1]), 10, 64)
+	if err != nil {
+		return btrfsUsage{}, fmt.Errorf("%w: Device size not numeric: %v", errBtrfsParse, err)
+	}
+	used, err := strconv.ParseUint(string(usedMatch[1]), 10, 64)
+	if err != nil {
+		return btrfsUsage{}, fmt.Errorf("%w: Used not numeric: %v", errBtrfsParse, err)
+	}
+
+	// Available is best-effort. If "Free (estimated):" is missing or
+	// non-numeric, derive it from total-used. The endpoint contract
+	// requires a value; an off-by-some on btrfs is acceptable given
+	// btrfs's own Free estimation is itself an estimate.
+	var available uint64
+	if freeMatch != nil {
+		if v, perr := strconv.ParseUint(string(freeMatch[1]), 10, 64); perr == nil {
+			available = v
+		}
+	}
+	if available == 0 && total > used {
+		available = total - used
+	}
+
+	return btrfsUsage{
+		Total:     total,
+		Used:      used,
+		Available: available,
+	}, nil
+}
@@ -0,0 +1,174 @@
+//go:build !windows
+
+package main
+
+import (
+	"errors"
+	"os"
+	"strings"
+	"testing"
+)
+
+// Real `btrfs filesystem usage --raw` output captured from David's
+// ZimaOS box (Brookdale NAS, /dev/md0). Kept inline rather than read
+// from disk so the test is hermetic. Mirrors
+// docs/internal/research/test-fixtures/zimaos-btrfs-usage.txt.
+const fixtureBtrfsUsageRaw = `Overall:
+    Device size:                2000263643136
+    Device allocated:           1968050339840
+    Device unallocated:           32213303296
+    Device missing:                        0
+    Device slack:                          0
+    Used:                       1225996673024
+    Free (estimated):            769020321792      (min: 752913670144)
+    Free (statfs, df):           769019273216
+    Data ratio:                         1.00
+    Metadata ratio:                     2.00
+    Global reserve:                536870912      (used: 0)
+    Multiple profiles:                    no
+
+Data,single: Size:1959599865856, Used:1222792847360 (62.40%)
+   /dev/md0      1959599865856
+
+Metadata,DUP: Size:4216848384, Used:1601634304 (37.98%)
+   /dev/md0         8433696768
+
+System,DUP: Size:8388608, Used:278528 (3.32%)
+   /dev/md0           16777216
+
+Unallocated:
+   /dev/md0         32213303296
+`
+
+func TestParseBtrfsUsageRaw_RealFixture(t *testing.T) {
+	usage, err := parseBtrfsUsageRaw([]byte(fixtureBtrfsUsageRaw))
+	if err != nil {
+		t.Fatalf("expected success, got: %v", err)
+	}
+
+	const (
+		wantTotal     = uint64(2000263643136)
+		wantUsed      = uint64(1225996673024)
+		wantAvailable = uint64(769020321792)
+	)
+	if usage.Total != wantTotal {
+		t.Errorf("Total = %d, want %d", usage.Total, wantTotal)
+	}
+	if usage.Used != wantUsed {
+		t.Errorf("Used = %d, want %d", usage.Used, wantUsed)
+	}
+	if usage.Available != wantAvailable {
+		t.Errorf("Available = %d, want %d", usage.Available, wantAvailable)
+	}
+}
+
+// Regression: the per-block-group lines have "Used:" mid-line (e.g.
+// "Data,single: Size:N, Used:N (62.40%)"). The Overall: parser must
+// only match the bare-line Used:, not these.
+func TestParseBtrfsUsageRaw_InlineUsedRegression(t *testing.T) {
+	// Strip the Overall: block to verify the parser does NOT pick up
+	// the inline Used field as a substitute.
+	overallEnd := strings.Index(fixtureBtrfsUsageRaw, "\nData,single:")
+	if overallEnd < 0 {
+		t.Fatal("test fixture malformed: missing Data,single section marker")
+	}
+	withoutOverall := fixtureBtrfsUsageRaw[overallEnd:]
+
+	_, err := parseBtrfsUsageRaw([]byte(withoutOverall))
+	if err == nil {
+		t.Fatal("expected parse error when Overall: block is missing, got nil")
+	}
+	if !errors.Is(err, errBtrfsParse) {
+		t.Errorf("expected errBtrfsParse, got %v", err)
+	}
+}
+
+func TestParseBtrfsUsageRaw_MissingDeviceSize(t *testing.T) {
+	input := `Overall:
+    Used:                       1225996673024
+`
+	_, err := parseBtrfsUsageRaw([]byte(input))
+	if !errors.Is(err, errBtrfsParse) {
+		t.Errorf("expected errBtrfsParse, got %v", err)
+	}
+}
+
+func TestParseBtrfsUsageRaw_MissingUsed(t *testing.T) {
+	input := `Overall:
+    Device size:                2000263643136
+`
+	_, err := parseBtrfsUsageRaw([]byte(input))
+	if !errors.Is(err, errBtrfsParse) {
+		t.Errorf("expected errBtrfsParse, got %v", err)
+	}
+}
+
+func TestParseBtrfsUsageRaw_EmptyInput(t *testing.T) {
+	_, err := parseBtrfsUsageRaw([]byte(""))
+	if !errors.Is(err, errBtrfsParse) {
+		t.Errorf("expected errBtrfsParse, got %v", err)
+	}
+}
+
+func TestParseBtrfsUsageRaw_NonNumericTotal(t *testing.T) {
+	input := `Overall:
+    Device size:                NOTANUMBER
+    Used:                       1225996673024
+`
+	// Regex requires \d+, so a non-numeric value won't even match the
+	// capture group -- this tests that path through the error.
+	_, err := parseBtrfsUsageRaw([]byte(input))
+	if !errors.Is(err, errBtrfsParse) {
+		t.Errorf("expected errBtrfsParse, got %v", err)
+	}
+}
+
+// Available falls back to Total - Used when Free (estimated) is missing.
+func TestParseBtrfsUsageRaw_AvailableFallback(t *testing.T) {
+	input := `Overall:
+    Device size:                1000
+    Used:                       300
+`
+	usage, err := parseBtrfsUsageRaw([]byte(input))
+	if err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	if usage.Total != 1000 {
+		t.Errorf("Total = %d, want 1000", usage.Total)
+	}
+	if usage.Used != 300 {
+		t.Errorf("Used = %d, want 300", usage.Used)
+	}
+	if usage.Available != 700 {
+		t.Errorf("Available = %d, want 700 (Total - Used fallback)", usage.Available)
+	}
+}
+
+// errBtrfsProgsMissing is returned when btrfs is not on PATH. We
+// simulate this by setting PATH to a directory we know doesn't have
+// btrfs. Skip if the test environment doesn't allow PATH manipulation
+// (very rare but possible).
+func TestTryBtrfsFallback_BinaryMissing(t *testing.T) {
+	origPath := os.Getenv("PATH")
+	t.Cleanup(func() { os.Setenv("PATH", origPath) })
+
+	// Empty PATH guarantees exec.LookPath fails for "btrfs". We don't
+	// need /tmp to be free of a btrfs binary -- empty PATH is enough.
+	if err := os.Setenv("PATH", ""); err != nil {
+		t.Skipf("cannot set PATH for test: %v", err)
+	}
+
+	_, err := tryBtrfsFallback("/")
+	if !errors.Is(err, errBtrfsProgsMissing) {
+		t.Errorf("expected errBtrfsProgsMissing, got %v", err)
+	}
+}
+
+// Note on timeout testing: the timeout path requires a btrfs binary
+// that hangs longer than 5s. Constructing this hermetically would
+// require a test double that injects a fake runner via a package-level
+// hook. The current implementation uses exec.LookPath + exec.CommandContext
+// directly for clarity; if timeout flakiness is reported in production
+// we can refactor to inject a runner. For now the timeout sentinel is
+// covered by code review of the ctx.Err() == context.DeadlineExceeded
+// branch in tryBtrfsFallback.
@@ -3,6 +3,7 @@
 package main
 
 import (
+	"errors"
 	"log"
 	"syscall"
 )
@@ -37,6 +38,34 @@ func (fc *FilesystemCache) Refresh() {
 		info.UsedBytes = info.TotalBytes - freeBytes
 		info.AvailableBytes = stat.Bavail * uint64(stat.Bsize)
 
+		// Phase 1A: btrfs statvfs fallback.
+		//
+		// We trigger fallback only when TotalBytes == 0 on a btrfs mount.
+		// We do NOT broaden the trigger to "implausible non-zero" cases
+		// (e.g. btrfs single-disk near-full overstating free). That would
+		// fork a subprocess on every poll cycle for every btrfs mount,
+		// which is wasteful. The CTO's panel point that btrfs CLI is the
+		// more reliable source still stands -- this is a deliberate
+		// performance/reliability tradeoff. See plan-btrfs-filesystem-
+		// reporting.md for the full reasoning.
+		if info.TotalBytes == 0 && cfg.FSType == "btrfs" {
+			usage, err := tryBtrfsFallback(cfg.Path)
+			switch {
+			case err == nil:
+				info.TotalBytes = usage.Total
+				info.UsedBytes = usage.Used
+				info.AvailableBytes = usage.Available
+				log.Printf("filesystem: using btrfs-progs for %s (statvfs returned zero)", cfg.Path)
+			case errors.Is(err, errBtrfsProgsMissing):
+				log.Printf("filesystem: btrfs-progs not installed, returning statvfs zeros for %s", cfg.Path)
+			case errors.Is(err, errBtrfsTimeout):
+				log.Printf("filesystem: btrfs filesystem usage timed out after 5s for %s", cfg.Path)
+			default:
+				// Wraps errBtrfsParse or an exec error treated as parse-class.
+				log.Printf("filesystem: btrfs filesystem usage parse error for %s: %v", cfg.Path, err)
+			}
+		}
+
 		if info.TotalBytes > 0 {
 			info.UsePercent = float64(info.UsedBytes) / float64(info.TotalBytes) * 100.0
 			// Round to one decimal place.