suy-sideguy

Watch an agent process and SIGKILL it on policy violation. Userspace warden that scores file, network, and subprocess behavior against a YAML policy and stops the agent at the action that's about to break things — not the postmortem an hour later.

If your agent passed every static check and then deleted 40 files in 8 seconds, this is the watcher that would have stopped it at file 4. The static gate was never going to catch a runtime decision.

Pain

Your agent ran rm -rf outside /tmp at 2am and you found out from the morning standup. The audit log was perfect; it just wasn't going to wake anyone up.
You added an LLM judge in front of every shell call. It's 800ms per action, doubles your cost, and still missed the 200ms read of ~/.ssh/id_rsa because the judge isn't on the file-system event path.
You tried --agent-name my-agent once. It matched three unrelated processes including your editor. PID-target your enforcement or don't bother.
Your "policy YAML" is an aspirational doc, not something a process is enforcing. A policy without an enforcer is a memo.
You're treating runtime safety as a layer you'll add "after MVP." MVP shipped; the agent has shell access; the layer didn't.

Install

pip install suy-sideguy

Python 3.9+.

Quick start

suy-warden --scope examples/scope.generic.yaml --agent-pid 12345 --poll 0.5

Live output while the agent runs:

target=12345 verdict=SAFE action=continue
target=12345 verdict=FLAG action=log_continue  reason=high_fd_count
target=12345 verdict=HALT action=freeze        reason=mass_deletion_3_in_10s

After a run:

suy-forensic-report --last-hours 24

When to use it

Use suy-sideguy when you run autonomous or semi-autonomous agents and need userspace runtime containment, policy enforcement, and forensic evidence — as one layer in a defense-in-depth setup.

When not to use it

Not a kernel-level sandbox. If your threat model requires kernel isolation, use a kernel sandbox.
Not a substitute for input-side prompt-injection defenses.
Not proof that any single policy file covers every workload safely. Policies need calibration on each workload.

What this tool is (and is not)

✅ What it does

Watches process, file, and network behavior for an agent process
Applies policy rules (optionally with a local LLM judge via Ollama)
HALT suspicious actions (freeze + alert) before they escalate
KILL severe violations with SIGKILL when policy requires it
Stores evidence and can generate incident-ready forensic reports

⚠️ What it does not do

It is not kernel-level enforcement (it runs in userspace)
File visibility via psutil.open_files() is best-effort and OS-dependent
Network checks are based on observed remote IP/port; domain matching can be lossy after DNS

Install

pip install suy-sideguy

Requires Python 3.9+.

For development:

git clone https://github.com/hermes-labs-ai/suy-sideguy.git
cd suy-sideguy
pip install -e ".[dev]"

5-minute quickstart

1) Choose target process

Use one of:

--agent-pid (recommended for production)
--agent-name (convenient, but can match unintended processes)

2) Start from the example policy scope

Open examples/scope.openclaw.yaml
For staged rollout, start with examples/scope.low-disruption.yaml
Narrow allowlists to only what your workload truly needs
For a generic baseline, start with examples/scope.generic.yaml

3) Run the warden

# Safer targeting: PID
suy-warden --scope examples/scope.generic.yaml --agent-pid 12345 --poll 0.5

# Convenience targeting: process name
suy-warden --scope examples/scope.generic.yaml --agent-name my-agent --poll 0.5

4) Generate report after a run

suy-forensic-report --last-hours 24

Where logs and evidence go

Default paths:

~/.local/share/sysmond/warden.log
~/.local/share/sysmond/logs/actions_*.jsonl
~/.local/share/sysmond/logs/incidents/*.json

Tip: treat these as security artifacts. Protect access and define retention/rotation.

Recommended rollout strategy

Audit-first: observe and tune policy without auto-kill pressure.
Confirm mode for ambiguous events.
Enforce mode only after your hard invariants are validated.

Important safety caveats

SIGKILL is immediate and can interrupt legitimate work if policy is too broad.
Name matching (--agent-name) can over-match; prefer PID targeting in production.
This project should be one part of a layered defense strategy.

Verdict levels

Verdict	Meaning	Response
`SAFE`	Action is within policy	Continue
`FLAG`	Suspicious, below threshold	Log, continue
`HALT`	Dangerous pattern detected	Freeze, alert operator
`KILL`	Critical violation	`SIGKILL` immediately

HALT triggers (freeze + alert, agent is not killed)

3+ file deletions in 10 seconds — mass deletion pattern
curl or wget spawned — unexpected outbound data transfer
50+ network calls in 60 seconds — bulk messaging / API spam
Write outside allowed workspace — unexpected filesystem access

KILL triggers (immediate, no recovery)

SSH key access — any read/write to ~/.ssh/ or *id_rsa*, *id_ed25519*
Config tampering — writing to ~/.openclaw/openclaw.json
rm -rf on non-tmp paths — destructive sweep outside /tmp
Forbidden paths — any path in your scope's deny_write list

Expected flag noise (early rollout)

Early flag noise is normal during policy calibration on real workloads.

Treat early FLAG events as calibration data, not immediate defects.
Use policy thresholds (flag_threshold, flag_window) to control when accumulated risk escalates to kill.
Keep hard invariants (e.g., forbidden secrets paths / destructive commands) as immediate stop conditions.
Start in audit-first mode, then tighten only after reviewing forensic logs.

Release quality status

Current status based on repository checks and CI configuration; not a formal security certification.

✅ Tests in repo (pytest)
✅ Package buildable (python -m build)
✅ CI workflow (.github/workflows/ci.yml)
✅ Publish workflow (.github/workflows/publish.yml)
✅ Security disclosure policy (SECURITY.md)

If suy-sideguy saves you time, please star the repo — it helps others find it.

About Hermes Labs

Hermes Labs builds AI audit infrastructure for enterprise AI systems — EU AI Act readiness, ISO 42001 evidence bundles, continuous compliance monitoring, agent-level risk testing. We work with teams shipping AI into regulated environments.

Our OSS philosophy — read this if you're deciding whether to depend on us:

Everything we release is free, forever. MIT or Apache-2.0. No "open core," no SaaS tier upsell, no paid version with the features you actually need. You can run this repo commercially, without talking to us.
We open-source our own infrastructure. The tools we release are what Hermes Labs uses internally — we don't publish demo code, we publish production code.
We sell audit work, not licenses. If you want an ANNEX-IV pack, an ISO 42001 evidence bundle, gap analysis against the EU AI Act, or agent-level red-teaming delivered as a report, that's at hermes-labs.ai. If you just want the code to run it yourself, it's right here.

The Hermes Labs OSS audit stack (public, production-grade, no SaaS):

Static audit (before deployment)

lintlang — Static linter for AI agent configs, tool descriptions, system prompts. pip install lintlang
rule-audit — Static prompt audit — contradictions, coverage gaps, priority ambiguities
scaffold-lint — Scaffold budget + technique stacking. pip install scaffold-lint
intent-verify — Repo intent verification + spec-drift checks

Runtime observability (while the agent runs)

little-canary — Prompt injection detection via sacrificial canary-model probes
colony-probe — Prompt confidentiality audit — detects system-prompt reconstruction

Regression & scoring (to prove what changed)

hermes-jailbench — Jailbreak regression benchmark. pip install hermes-jailbench
agent-convergence-scorer — Score how similar N agent outputs are. pip install agent-convergence-scorer

Supporting infra

claude-router · zer0dex · forgetted · quick-gate-python · quick-gate-js · repo-audit

Natural pairing: suy-sideguy is the runtime-containment chapter. Pair with lintlang (pre-deployment static gate) and little-canary (input-side injection detection) for defense in depth.

Development

pip install -e .[dev]
pytest

Also see:

CONTRIBUTING.md
SECURITY.md
PUBLISH_CHECKLIST.md
AGENTS.md
CODE_OF_CONDUCT.md
Audit checklist: docs/AUDIT_CHECKLIST.md
Layered plan: docs/IMPLEMENTATION_PLAN_LAYERED.md

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
assets		assets
docs		docs
examples		examples
suy_sideguy		suy_sideguy
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PUBLISH_CHECKLIST.md		PUBLISH_CHECKLIST.md
README.md		README.md
SECURITY.md		SECURITY.md
llms.txt		llms.txt
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

suy-sideguy

Pain

Install

Quick start

When to use it

When not to use it

What this tool is (and is not)

✅ What it does

⚠️ What it does not do

Install

5-minute quickstart

1) Choose target process

2) Start from the example policy scope

3) Run the warden

4) Generate report after a run

Where logs and evidence go

Recommended rollout strategy

Important safety caveats

Verdict levels

HALT triggers (freeze + alert, agent is not killed)

KILL triggers (immediate, no recovery)

Expected flag noise (early rollout)

Release quality status

About Hermes Labs

Development

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

suy-sideguy

Pain

Install

Quick start

When to use it

When not to use it

What this tool is (and is not)

✅ What it does

⚠️ What it does not do

Install

5-minute quickstart

1) Choose target process

2) Start from the example policy scope

3) Run the warden

4) Generate report after a run

Where logs and evidence go

Recommended rollout strategy

Important safety caveats

Verdict levels

HALT triggers (freeze + alert, agent is not killed)

KILL triggers (immediate, no recovery)

Expected flag noise (early rollout)

Release quality status

About Hermes Labs

Development

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages