Security Public-Readiness Audit

Audit date: 2026-02-24
Branch: audit/security-public-readiness

Scope

Repository content and tracked artifacts
Secret-leak risk in current files
GitHub repo security posture (settings/rules/scanning)
CI/workflow hardening baseline

What Was Checked

Secret checks: scripts/ci/secret-detection.sh (passed)
Targeted scans for credential patterns and personal test strings
Workflow review: .github/workflows/*.yml
Repo settings via GitHub API:
- security_and_analysis
- branch protection
- rulesets

Findings

Critical Before Public

GitHub security analysis features are disabled

Evidence: repo security_and_analysis shows disabled for code security, Dependabot security updates, secret scanning, and push protection.
Risk: no server-side leak/vuln detection on PRs or pushes.
Required action:
- Enable secret scanning + push protection
- Enable Dependabot security updates
- Enable code scanning / code security

main is not protected and no rulesets are configured

Evidence: branch protection API returns Branch not protected; rulesets list is empty.
Risk: accidental direct pushes, no required checks/review gates, weaker change control.
Required action:
- Protect main (require PR + required status checks + disallow force-push)
- Add at least one ruleset for repository-wide branch policy

Benchmarks include high-risk publication content (privacy/leakage surface)

Evidence includes:
- absolute local paths (for example /home/<user>/...) in benchmark manifests
- full prompt/response corpora in checked-in rows.jsonl
- historical test credential strings in benchmark data (<test_user>, <test_pass>)
- runtime metadata/trace identifiers in checked-in artifacts (for example otel_ids.json)
Risk: leaks workstation identity/pathing, sensitive prompt contents, and potential credential material over time.
Required action:
- Define and enforce a benchmark publication policy:
  - public repo keeps only sanitized summaries/aggregates
  - raw rows/log/trace artifacts remain private
- Purge non-compliant benchmark artifacts before public launch

Possible credential exposure history for historical test credentials

Evidence: strings existed in tracked files and benchmark artifacts; they were used in checks and are present in git history.
Risk: if those credentials were real at any point, they are compromised.
Required action:
- Treat as exposed and rotate/revoke immediately
- Remove all remaining occurrences from tracked files
- Decide whether to rewrite history before going public

Important Hardening (Should Do)

README includes a live Graphistry URL/tokenized sample

Evidence: README.md sample output includes a full live URL with dataset + viztoken.
Action: replace with redacted placeholder URL unless explicitly intended to remain public forever.

GitHub Actions are tag-pinned (@v4) not SHA-pinned

Evidence: actions/checkout@v4, actions/setup-node@v4.
Risk: supply-chain drift from moving tags.
Action: pin third-party actions to full commit SHAs for stricter provenance.

OTel helper uses insecure gRPC exporter mode by default

Evidence: bin/otel/log_event.py uses OTLPLogExporter(..., insecure=True).
Action: keep local-default behavior only if clearly documented; gate with env toggle for non-local use.

Low-Risk Improvements

Add CODEOWNERS for security-sensitive paths

Suggested owners for:
- .github/workflows/**
- scripts/ci/**
- .agents/skills/**
- benchmarks/**

Add a pre-public release checklist doc and CI policy check

Validate no benchmark raw rows/logs are added without explicit allowlist.

Quick Fixes Applied in This Branch

Removed personal test credential strings from active journey specs

Replaced historical test literals with non-personal sentinel literals.

Tightened CI workflow token permissions

Added permissions: contents: read to .github/workflows/ci.yml.

Improved ignore rules for local temporary artifacts

Added .tmp/ and tmp/ to .gitignore.

Go-Public Gate (Recommended)

Do not switch visibility to public until all Critical items above are complete.

Minimum gate:

Credentials rotated/revoked (if ever real) and artifact cleanup complete
Benchmark publication policy enforced; sensitive artifacts removed
GitHub security scanning + push protection enabled
Main branch protection/ruleset enabled

Strike List: Operator-Owned

Rotate/revoke any potentially exposed credentials (including any prior historical test credential usage if real), then confirm they are dead.
Decide benchmark publication policy for public repo:
public-safe summaries only vs full raw eval corpora in-repo.
If policy excludes raw corpora: approve history rewrite strategy (rewrite vs preserve and move private).
In GitHub repo settings, enable:
Secret scanning
Secret scanning push protection
Dependabot security updates
Code security / code scanning
Protect main:
require pull requests
require status checks
disallow force pushes/deletions
Add/approve org ruleset(s) for default branch governance.
Decide whether README keeps a live tokenized sample URL or switches to redacted placeholder.
Decide CODEOWNERS ownership map for security-sensitive paths.

Strike List: Agent-Owned

Implement benchmark sanitization pipeline (public-safe benchmark report mode and redacted aggregate outputs).
Apply publication policy to existing benchmark artifacts and produce a clean public-safe benchmark set.
Add CI guard that fails on non-compliant benchmark artifacts.
Pin third-party GitHub Actions to immutable SHAs.
Add .github/CODEOWNERS once owner mapping is confirmed.
Redact README live sample URL if operator chooses redaction.
Run full secret/leak scans after sanitization and publish a signed-off audit rerun.
Produce final pre-public checklist report with explicit pass/fail gate status.

Completion Update (2026-02-24)

Completed git history rewrite on branch audit/security-public-readiness to:
- remove historical benchmark data/report blobs from history
- replace historical credential/local-path literals in remaining history
Restored a curated public-safe benchmark set:
- benchmarks/data/2026-02-23-postcleanup-fullsweep/combined_metrics.json
- benchmarks/data/2026-02-23-codex-effort-ab/combined_metrics.json
- benchmarks/data/2026-02-21-scenario-coverage-audit-v2.json
- benchmarks/reports/2026-02-23-postcleanup-fullsweep.md
- benchmarks/reports/2026-02-23-codex-effort-ab.md
Validation rerun after rewrite:
- scripts/ci/secret-detection.sh passed
- scripts/ci/validate_public_benchmarks.sh passed
- scripts/ci/validate_skills.py passed
- git fsck --full passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security Public-Readiness Audit

Scope

What Was Checked

Findings

Critical Before Public

Important Hardening (Should Do)

Low-Risk Improvements

Quick Fixes Applied in This Branch

Go-Public Gate (Recommended)

Strike List: Operator-Owned

Strike List: Agent-Owned

Completion Update (2026-02-24)

FilesExpand file tree

SECURITY_PUBLIC_READINESS_AUDIT.md

Latest commit

History

SECURITY_PUBLIC_READINESS_AUDIT.md

File metadata and controls

Security Public-Readiness Audit

Scope

What Was Checked

Findings

Critical Before Public

Important Hardening (Should Do)

Low-Risk Improvements

Quick Fixes Applied in This Branch

Go-Public Gate (Recommended)

Strike List: Operator-Owned

Strike List: Agent-Owned

Completion Update (2026-02-24)