-
Notifications
You must be signed in to change notification settings - Fork 93
Dangerous mode: opt-in guardrail bypass for autonomous execution #559
Copy link
Copy link
Open
Labels
agentdomain:automationScheduler, autonomy, RAG, web search, watchers, researchScheduler, autonomy, RAG, web search, watchers, researchenhancementNew feature or requestNew feature or requestp1medium prioritymedium prioritysecuritySecurity-sensitive changesSecurity-sensitive changestrack:consumer-appHermes-competitor consumer product — mobile-first, voice + messaging + memory + skillsHermes-competitor consumer product — mobile-first, voice + messaging + memory + skills
Metadata
Metadata
Assignees
Labels
agentdomain:automationScheduler, autonomy, RAG, web search, watchers, researchScheduler, autonomy, RAG, web search, watchers, researchenhancementNew feature or requestNew feature or requestp1medium prioritymedium prioritysecuritySecurity-sensitive changesSecurity-sensitive changestrack:consumer-appHermes-competitor consumer product — mobile-first, voice + messaging + memory + skillsHermes-competitor consumer product — mobile-first, voice + messaging + memory + skills
Summary
Add an opt-in dangerous mode that completely disables all guardrails — no confirmation prompts, no action tier restrictions. The agent executes everything immediately without asking. Disabled by default.
Parent issue: #555 (Autonomous mode)
Use Cases
UI
gaia chat --autonomous --dangerousdangerous_mode: truein agent configDangerous + Autonomous Double Confirmation
Enabling dangerous mode while autonomous mode is on triggers an additional confirmation:
"Dangerous mode with autonomous execution means the agent can take destructive actions without your approval, even when you are not watching. Are you sure?"
This combination requires explicit acknowledgment because the agent may act between sessions with no user present.
Standard Guardrail Tiers (for reference — dangerous mode skips all of these)
Acceptance Criteria
--dangerousflag