[TTAHUB-5243] Add security findings register ADR and operating specification by kryswisnaskas · Pull Request #3710 · HHS/Head-Start-TTADP

kryswisnaskas · 2026-06-17T19:15:57Z

Description of change

Adds ADR 0027 proposing to maintain a repository owned security findings register under security/, and security/README.md defining the full operating specification. The spec covers the control inventory (Semgrep, Yarn Audit, ZAP, ClamAV), the common finding schema, disposition model, approval authority requirements, CI enforcement policy, and SCA specific pending observations escalation rules.
.

How to test

Review docs/adr/0027-security-findings-register.md and security/README.md and verify they meet the TTAHUB-5243 acceptance criteria.

Jira Issue(s)

https://jira.acf.gov/browse/TTAHUB-5243

Checklists

Every PR

Linked Jira issue
JIRA issue status updated
[n/a] Code is meaningfully tested — documentation only, no executable code
[n/a] Meets accessibility standards (WCAG 2.1 Levels A, AA) — no UI changes
[n/a] API Documentation updated — no API changes
[n/a] Boundary diagram updated — no new external integrations
[n/a] Logical Data Model updated — no schema changes
Architectural Decision Records written for major infrastructure decisions — ADR 0027 added
[n/a] UI review complete — no UI changes
QA review complete

Before merge to main

OHS demo complete
Ready to create production PR

Production Deploy

PR created as Draft
Staging smoke test completed
PR transitioned to Open (this ready_for_review transition triggers the Slack/Jira automation)
Reviewer added after the PR is Open (elainaparrish is the authorized approver under normal circumstances)
- Sequence: Draft PR → Smoke test → Open PR (automation runs) → Add reviewer
- Confirm that the Slack notification was sent after the PR was opened
- Confirm that linked Jira ticket(s) transitioned as expected; if not, review the GitHub Actions workflow logs

After merge/deploy

Update JIRA ticket status

Bumps [fast-xml-builder](https://github.com/NaturalIntelligence/fast-xml-builder) from 1.1.5 to 1.2.0. - [Changelog](https://github.com/NaturalIntelligence/fast-xml-builder/blob/main/CHANGELOG.md) - [Commits](NaturalIntelligence/fast-xml-builder@v1.1.5...v1.2.0) --- updated-dependencies: - dependency-name: fast-xml-builder dependency-version: 1.2.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>

Copilot

Pull request overview

This PR adds design documentation for a repository-owned Security Findings Register under security/, establishing governance and an operating specification ahead of implementing tooling and CI enforcement.

Changes:

Added ADR 0027 documenting the decision to maintain a repository-owned security findings register and its governance model.
Added security/README.md specifying the proposed register schema, identity rules, approval requirements, and CI enforcement policies (including SCA-specific escalation behavior).

Impact assessment: Benefits medium (clear, centralized audit evidence model and enforcement rules); risks low (documentation-only), with one clarification needed to avoid ambiguous CI behavior.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
security/README.md	Operating specification for the proposed security findings register (schema, identity, approval, and CI enforcement policy).
docs/adr/0027-security-findings-register.md	ADR establishing the decision and summarizing the register model and enforcement approach.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

thewatermethod

Consider logging tickets or addressing the following (cherry-picked from UI review):

`git show` arg is interpolated from user input without an allow-list

File: tools/security-findings.js:1112-1117

contents = execFileSync('git', ['show', `${pendingRef}:${gitPath}`], {
  cwd,
  encoding: 'utf8',
  stdio: ['ignore', 'pipe', 'pipe'],
});

Problem: pendingRef originates from the --previous-pending-ref CLI flag (tools/security-findings.js:1499, 1605, 1628). Because execFileSync does not invoke a shell, there is no shell-injection risk. However, the value is concatenated directly into a git refspec without validation. A malicious or careless caller can pass things like HEAD -- some/other/file or rely on Git's flexible revision parser to read arbitrary tracked files (e.g., HEAD:.env.production if that file ever exists). In the current CI use this is harmless because the value is a hard-coded literal HEAD~1 in package.json, but the tool is intended to be re-used and the input is unvalidated.

Impact: Low — local-only tool, no shell expansion, no current untrusted input path. Surface to harden because the project does in fact run secrets in CI runners.

Suggested Fix: Validate the ref format and reject anything containing whitespace, :, or .. patterns that would re-target the path component:

const GIT_REF_PATTERN = /^[A-Za-z0-9_./~^@-]+$/;

function loadPendingObservationsStoreFromGitRef({ pendingPath, pendingRef, cwd = process.cwd() } = {}) {
  if (!GIT_REF_PATTERN.test(pendingRef)) {
    throw new Error(`Invalid --previous-pending-ref ${JSON.stringify(pendingRef)}`);
  }
  const gitPath = toGitPath(pendingPath);
  // …
}

Additional Context: Project convention from AGENTS.md "Traps to Avoid → SQL injection in filters" calls for "independently validate expected types … before using them in queries"; the same hygiene applies to ref values passed to git show.

`arraysMatch` is order-sensitive and serializes whole arrays

File: tools/security-findings.js:685-687

function arraysMatch(left = [], right = []) {
  return JSON.stringify(left) === JSON.stringify(right);
}

Problem: Used by validateSastBaselineEvidence to detect drift between security/sast/baseline.json and security/sast/scan-config.json for fields like configs, include, exclude, blockingSeverities. If the two arrays contain the same logical set of values in a different order — for example because Semgrep reordered config sources, or someone hand-edits one file — the validator will warn "differs from scan config" even though nothing meaningful changed. Conversely, deeply nested objects-in-arrays that match shallowly but have undefined-vs-missing keys can mismatch under JSON.stringify.

Impact: Low — drives warnings, not errors. But noisy warnings train people to ignore them, undermining the value of the drift check.

Suggested Fix: For these specific config fields, compare as sorted sets where order is not semantically meaningful, and keep deep equality only for fields where order matters:

function arraysMatchUnordered(left = [], right = []) {
  if (left.length !== right.length) return false;
  const sortedLeft = [...left].map(String).sort();
  const sortedRight = [...right].map(String).sort();
  return sortedLeft.every((value, idx) => value === sortedRight[idx]);
}

const driftChecks = [
  ['version', baseline.scanner.version, scanConfig.semgrepVersion, 'scalar'],
  ['configs', baseline.scanner.configs, scanConfig.configs, 'unordered'],
  ['include', baseline.scanner.include, scanConfig.include, 'unordered'],
  ['exclude', baseline.scanner.exclude, scanConfig.exclude, 'unordered'],
  // …
];

Additional Context: If order is in fact semantically significant for configs (Semgrep applies them in order), keep arraysMatch for that one field and use unordered comparison for the others.

`tools/security-findings.js` is becoming a god-file

File: tools/security-findings.js (1674 lines)

Problem: A single module now contains: baseline construction, register seeding, register validation, deadline enforcement, pending-observation store management, git-ref I/O, severity mapping, business-day math, ISO date validation, CLI parsing, and pretty-printing. Most functions are small and well-named, but the file is hard to scan and the module.exports selectively re-exports 16 helpers, making it unclear what is "public API" vs internal.

Impact: Future contributors will struggle to find logic; PR diffs touching unrelated concerns will sit on top of each other; testing requires importing from a large surface.

Suggested Fix: Split into a directory of focused modules under tools/security-findings/:

tools/security-findings/
  index.js                  // CLI entrypoint, parseCliArguments, main, printHelp
  paths.js                  // DEFAULT_* constants, resolveProjectPath, toGitPath
  dates.js                  // operationalDate, parseOperationalDate, businessDaysSince, calendarDaysBetween, validateIsoDate
  scan-types.js             // loadScanTypes, validateScanTypes, mapSeverity
  ids.js                    // buildSastFindingId, buildScaFindingId, sanitizeIdComponent
  sca-baselines.js          // collectAuditAdvisoriesFromFile, createScaBaseline(s)
  register.js               // build*RegisterEntries, createRegister, collectMigrationGaps, validateRegister
  pending-observations.js   // load*, validatePendingObservations, updatePendingObservations, buildPendingObservationEntry, slaThresholdForSeverity

This also lets you write per-module tests, which are easier to target than the current single 1.4k-line test file. Not blocking; a follow-up ticket is fine.

Default `ticket = 'TTAHUB-5243'` in `createRegister` will mis-attribute future re-seeds

File: tools/security-findings.js:551 (and 639)

function buildScaRegisterEntries({
  // …
  owner = 'TTA Hub AppDev',
  ticket = 'TTAHUB-5243',
  // …

Problem: TTAHUB-5243 is the implementation ticket for the register itself. Using it as the default owner ticket for SCA entries is appropriate for the one-time initial migration seed. However, if someone runs yarn security:register:seed again later (e.g., after a fresh build-sca-baselines), all newly-seeded entries will be tagged with the implementation ticket — masking the real outstanding work and burning audit trust.

Impact: Low — the JIRA ticket can be corrected by hand, and validation does not enforce ticket uniqueness. But the placeholder is sticky and easy to forget.

Suggested Fix: Either (a) require --ticket to be passed explicitly for seed-register and fail when it is not, or (b) leave ticket null by default so the resulting migration.gaps include missing-ticket, which forces explicit assignment. Option (b) is minimally invasive:

function buildScaRegisterEntries({
  // …
  owner = 'TTA Hub AppDev',
  ticket = null,
  closureTarget = null,
  justification = 'Migrated from the legacy Yarn Audit active exception set. ' +
    'Technical assessment and approval evidence still need to be added.',
  // …

Then package.json (or CI) can pass --ticket TTAHUB-5243 only for the genuine initial migration.

Nested ternary in `enforceDispositionDeadlines` is harder to read than necessary

File: tools/security-findings.js:850-855

const dueField =
  entry.disposition === 'deferred'
    ? 'closureTarget'
    : entry.disposition === 'accepted'
      ? 'reviewBy'
      : null;

Problem: Minor readability. The conditional could be expressed as a lookup, which also makes it trivial to extend if new dispositions are added later.

Suggested Fix:

const DUE_FIELD_BY_DISPOSITION = {
  deferred: 'closureTarget',
  accepted: 'reviewBy',
};

// …
const dueField = DUE_FIELD_BY_DISPOSITION[entry.disposition] || null;

`parseArgs` will throw on unknown options with an opaque error

File: tools/security-findings.js:1483-1513

Problem: node:util parseArgs throws on unknown flags. The CLI catches errors at line 1647 and prints error.message, but the message is unhelpful (e.g., Unknown option '--prevous-pending-ref') and there is no --help flag wired in — help is only available as a positional. A user with a typo gets a wall of stack with no remediation.

Impact: UX of the new CLI.

Suggested Fix: Add help: { type: 'boolean', short: 'h' } to the options, and in main short-circuit to printHelp() when values.help is set. Also wrap the parseArgs call so unknown-option errors print printHelp() after the error message:

function parseCliArguments(args = process.argv.slice(2)) {
  try {
    return parseArgsImpl(args);
  } catch (error) {
    console.error(error.message);
    printHelp();
    process.exit(2);
  }
}

Validation messages embed the same magic strings the tests pattern-match on

File: tools/security-findings.test.js:386-391, 505-509, 722, 865-866, etc.

Problem: Tests assert against substrings of human-readable error strings ("missing approval evidence", "due in 8 calendar days", "missing closure target"). Reasonable for now, but it couples message wording to test correctness. If a future tweak rewords a warning, tests will break for cosmetic reasons.

Impact: Low — tests are easy to update; nobody else parses these strings.

Suggested Fix (optional, for a future refactor): Have collectMigrationGaps return structured codes (e.g., MISSING_APPROVAL_EVIDENCE, MISSING_CLOSURE_TARGET) and translate to messages at the print boundary. Tests then assert on codes. Same idea for enforceDispositionDeadlines warning/error categories.

`paths` in SCA baseline can leak internal dependency graph details

File: tools/security-findings.js:213-222 and SCA baseline JSON output

Problem: SCA baselines record paths like email-templates>@ladjs/i18n>qs (see security/dependencies/backend-baseline.json and tests at line 268). These reveal the project's transitive dependency graph in version control. The information is already implicit in yarn.lock, so this is not a leak per se, but the data is also low-value for the register — it doesn't appear in the schema table in security/README.md:100-118 and is not used by the validator.

Impact: Cosmetic; modest churn in baseline diffs whenever transitive paths change without the advisory itself changing.

Suggested Fix (optional): Either document paths as a deliberate source-field for triage in the schema table, or drop it from the baseline and rely on yarn.lock for path provenance. If kept, consider keeping only the leaf module rather than the full ancestry chain.

AdamAdHocTeam

Three things to double check via AI:

Tooling and tests are not wired into CI (severity: high)
Backend yarn test runs jest build/server/src --runInBand (package.json:74). The new test file at security-findings.test.js lives outside src and is not in the TypeScript build output, so its 1,384 lines never execute in yarn test or yarn test:ci. None of security:validate, security:validate:live, security:validate:strict, or security:sca:pending are referenced in config.yml — only the pre-existing sast_scan job. Net effect: the central guarantee in the README ("security:validate fails when a current finding is absent from the register…") is not enforced for any pull request. The register can drift, the tool can regress, and CI will not catch either. Wire the validator and its test file into CircleCI before merging, or this PR delivers documentation without enforcement.
firstSeen cannot be trusted without a working --previous-pending-ref (severity: high)
In security-findings.js:1281-1298, validatePendingObservations chooses the "trusted prior" entry as:

When no prior store is supplied, the comparison uses actual itself, so expected.firstSeen === actual.firstSeen is tautological — a contributor can edit pending-observations.json to reset firstSeen and the validator will pass. Combined with issue #3 below, this defeats the SLA timer the ADR introduces. security:validate:live is the only path that supplies --previous-pending-ref HEAD~1, and even that is not invoked anywhere in CI today (see issue #1). The README documents this as "bootstrap" behavior, but the practical consequence is that firstSeen is mutable by anyone with commit access. At minimum, scheduled CI must run security:validate:live, and the validator should refuse to "bootstrap" when running in CI (e.g., require a non-null trusted store unless an explicit --bootstrap flag is set).

loadPendingObservationsStoreFromGitRef swallows real errors as "no prior snapshot" (severity: medium-high)
In security-findings.js:1115-1133, any git show failure whose stderr contains one of several substrings ('exists on disk, but not in', 'invalid object name', 'unknown revision or path not in the working tree', 'bad revision', or the very loose pair 'path' + 'does not exist') is treated as "the file did not exist in that ref" and returns null. This silently falls back to the untrusted bootstrap path described in issue #2.

The error-string detection is brittle in two ways:

It depends on English git messages and the exact phrasing of the installed git version. Localized environments (e.g., LC_ALL=de_DE) will not match and will throw — or worse, will match by accident.
More importantly, CircleCI's default checkout produces a shallow clone. HEAD~~1 is frequently unavailable; git show HEAD~~1:... then fails with messages like fatal: bad revision 'HEAD~1', which matches 'bad revision' and gets silently downgraded. The scheduled SCA workflow would then accept whatever firstSeen is committed today, with no warning — eliminating the SLA integrity guarantee the README promises.
Recommend: differentiate "ref/object missing" from "path missing inside an existing ref" (e.g., git cat-file -e $ref:$path first), fail loudly when the ref itself cannot be resolved, and ensure the scheduled workflow fetches enough depth before running.

kryswisnaskas · 2026-06-22T20:43:15Z

@thewatermethod , thanks. I addressed the findings that affect actual behavior and failure handling: ref validation/bootstrap handling, default ticket guarding, and clearer CLI errors. I did not address the others in this PR because they are either intentional (arraysMatch ordering, dependency paths) or stylistic (nested ternary, test message assertions). I also created follow-up ticket TTAHUB-5488 for the larger file refactor.

github-actions · 2026-06-23T19:53:55Z

⚠️ Diff size advisory: This PR is 7566 lines (7512+, 54−), exceeding the 500-line guideline. Consider splitting into smaller changes.

github-actions · 2026-06-23T19:53:55Z

⚠️ Review count advisory: 1 of 2 required human approvals. 1 more needed. Current approvers: thewatermethod.

dependabot Bot and others added 20 commits May 8, 2026 19:27

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

41c91fc

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

342270f

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

a62c429

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

8ae2bb7

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

7016a1c

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

2f9926e

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

d646a3e

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

e76ab80

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

cf71581

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

6fc4d8d

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

e639e05

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

dc9c625

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

d21bac5

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

d8e6c6b

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

deba56d

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

7707931

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

6a7b1ef

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

bb63b86

Add security register

b343a71

kryswisnaskas requested a review from Copilot June 17, 2026 19:21

Copilot started reviewing on behalf of kryswisnaskas June 17, 2026 19:21 View session

Copilot AI reviewed Jun 17, 2026

View reviewed changes

Comment thread security/README.md

kryswisnaskas and others added 5 commits June 17, 2026 16:28

Potential fix for pull request finding

1c2d724

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Merge README.md changes

94e91f4

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

6971932

Merge branch 'main' into kw/ttahub-5243

f168bb3

Improve writing

c6649e1

kryswisnaskas marked this pull request as ready for review June 18, 2026 17:56

kryswisnaskas requested a review from Andrew565 June 18, 2026 17:56

kryswisnaskas requested review from AdamAdHocTeam, hardwarehuman, thewatermethod and tommaroh June 18, 2026 17:56

github-actions Bot added the review-alerted PR has triggered an overdue review Slack alert label Jun 19, 2026

kryswisnaskas added 2 commits June 22, 2026 10:25

Merge branch 'main' of https://github.com/HHS/Head-Start-TTADP

78e9549

Merge branch 'main' into kw/ttahub-5243

76d13b7

thewatermethod reviewed Jun 22, 2026

View reviewed changes

thewatermethod approved these changes Jun 22, 2026

View reviewed changes

github-actions Bot removed the review-alerted PR has triggered an overdue review Slack alert label Jun 22, 2026

AdamAdHocTeam reviewed Jun 22, 2026

View reviewed changes

Address latest findings

6ad3844

Add temporary placeholders for Jira tickets

d8ae751

kryswisnaskas enabled auto-merge June 23, 2026 18:47

Add actual Jira tickets

5df1b08

kryswisnaskas added this pull request to the merge queue Jun 23, 2026

Merged via the queue into main with commit 57e9fad Jun 23, 2026
14 checks passed

kryswisnaskas deleted the kw/ttahub-5243 branch June 23, 2026 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TTAHUB-5243] Add security findings register ADR and operating specification#3710

[TTAHUB-5243] Add security findings register ADR and operating specification#3710
kryswisnaskas merged 30 commits into
mainfrom
kw/ttahub-5243

kryswisnaskas commented Jun 17, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

thewatermethod left a comment

Uh oh!

AdamAdHocTeam left a comment

Uh oh!

kryswisnaskas commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 23, 2026

Uh oh!

github-actions Bot commented Jun 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

kryswisnaskas commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of change

How to test

Jira Issue(s)

Checklists

Every PR

Before merge to main

Production Deploy

After merge/deploy

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

thewatermethod left a comment

Choose a reason for hiding this comment

git show arg is interpolated from user input without an allow-list

arraysMatch is order-sensitive and serializes whole arrays

tools/security-findings.js is becoming a god-file

Default ticket = 'TTAHUB-5243' in createRegister will mis-attribute future re-seeds

Nested ternary in enforceDispositionDeadlines is harder to read than necessary

parseArgs will throw on unknown options with an opaque error

Validation messages embed the same magic strings the tests pattern-match on

paths in SCA baseline can leak internal dependency graph details

Uh oh!

AdamAdHocTeam left a comment

Choose a reason for hiding this comment

Uh oh!

kryswisnaskas commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 23, 2026

Uh oh!

github-actions Bot commented Jun 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kryswisnaskas commented Jun 17, 2026 •

edited

Loading

`git show` arg is interpolated from user input without an allow-list

`arraysMatch` is order-sensitive and serializes whole arrays

`tools/security-findings.js` is becoming a god-file

Default `ticket = 'TTAHUB-5243'` in `createRegister` will mis-attribute future re-seeds

Nested ternary in `enforceDispositionDeadlines` is harder to read than necessary

`parseArgs` will throw on unknown options with an opaque error

`paths` in SCA baseline can leak internal dependency graph details