44
55<div align =" center " >
66
7+ <a href =" https://trendshift.io/repositories/15191 " target =" _blank " ><img src =" https://trendshift.io/api/badge/repositories/15191 " alt =" qualifire-dev%2Frogue | Trendshift " style =" width : 250px ; height : 55px ;" width =" 250 " height =" 55 " /></a >
8+
79![ Tests] ( https://github.com/qualifire-dev/rogue/actions/workflows/test.yml/badge.svg?branch=main )
810
911<img src =" ./freddy-rogue.png " width =" 200 " />
@@ -57,20 +59,20 @@ Simulate **adversarial attacks** to find security vulnerabilities.
5759
5860Rogue operates on a ** client-server architecture** with multiple interfaces:
5961
60- | Component | Description |
61- | ----------- | -------------|
62- | ** Server** | Core evaluation & red team logic |
63- | ** TUI** | Modern terminal interface (Go + Bubble Tea) |
64- | ** CLI** | Non-interactive mode for CI/CD pipelines |
62+ | Component | Description |
63+ | ---------- | ------------------------------------------- |
64+ | ** Server** | Core evaluation & red team logic |
65+ | ** TUI** | Modern terminal interface (Go + Bubble Tea) |
66+ | ** CLI** | Non-interactive mode for CI/CD pipelines |
6567
6668https://github.com/user-attachments/assets/b5c04772-6916-4aab-825b-6a7476d77787
6769
6870### Supported Protocols
6971
70- | Protocol | Transport | Description |
71- | ---------- | -----------| -------------|
72- | ** A2A** | HTTP | [ Google's Agent-to-Agent] ( https://a2a-protocol.org/latest/ ) protocol |
73- | ** MCP** | SSE, STREAMABLE_HTTP | [ Model Context Protocol] ( https://modelcontextprotocol.io/ ) via ` send_message ` tool |
72+ | Protocol | Transport | Description |
73+ | -------- | -- ------------------ | ---------------------------------------------------------------------------------- |
74+ | ** A2A** | HTTP | [ Google's Agent-to-Agent] ( https://a2a-protocol.org/latest/ ) protocol |
75+ | ** MCP** | SSE, STREAMABLE_HTTP | [ Model Context Protocol] ( https://modelcontextprotocol.io/ ) via ` send_message ` tool |
7476
7577See examples in [ ` examples/ ` ] ( ./examples/ ) for reference implementations.
7678
@@ -102,19 +104,20 @@ uvx rogue-ai --example=tshirt_store
102104```
103105
104106Configure in the UI:
107+
105108- ** Agent URL** : ` http://localhost:10001 `
106109- ** Mode** : Choose ` Automatic Evaluation ` or ` Red Teaming `
107110
108111---
109112
110113## Running Modes
111114
112- | Mode | Command | Description |
113- | ------| ---------| -------------|
114- | Default | ` uvx rogue-ai ` | Server + TUI |
115- | Server | ` uvx rogue-ai server ` | Backend only |
116- | TUI | ` uvx rogue-ai tui ` | Terminal client |
117- | CLI | ` uvx rogue-ai cli ` | Non-interactive (CI/CD) |
115+ | Mode | Command | Description |
116+ | ------- | --------------------- | ----------------------- |
117+ | Default | ` uvx rogue-ai ` | Server + TUI |
118+ | Server | ` uvx rogue-ai server ` | Backend only |
119+ | TUI | ` uvx rogue-ai tui ` | Terminal client |
120+ | CLI | ` uvx rogue-ai cli ` | Non-interactive (CI/CD) |
118121
119122### Server Options
120123
@@ -131,27 +134,27 @@ uvx rogue-ai cli \
131134 --business-context-file ./.rogue/business_context.md
132135```
133136
134- | Option | Description |
135- | --------| -------------|
136- | ` --config-file ` | Path to config JSON |
137- | ` --evaluated-agent-url ` | Agent endpoint (required) |
138- | ` --judge-llm ` | LLM for evaluation (required) |
139- | ` --business-context ` | Context string or ` --business-context-file ` |
140- | ` --input-scenarios-file ` | Scenarios JSON |
141- | ` --output-report-file ` | Report output path |
142- | ` --deep-test-mode ` | Extended testing |
137+ | Option | Description |
138+ | ------------------------ | ------------------------------------------- |
139+ | ` --config-file ` | Path to config JSON |
140+ | ` --evaluated-agent-url ` | Agent endpoint (required) |
141+ | ` --judge-llm ` | LLM for evaluation (required) |
142+ | ` --business-context ` | Context string or ` --business-context-file ` |
143+ | ` --input-scenarios-file ` | Scenarios JSON |
144+ | ` --output-report-file ` | Report output path |
145+ | ` --deep-test-mode ` | Extended testing |
143146
144147---
145148
146149## Red Teaming
147150
148151### Scan Types
149152
150- | Type | Vulnerabilities | Attacks | Time |
151- | ------| -----------------| ---------| ------|
152- | ** Basic** | 5 curated | 6 | ~ 2-3 min |
153- | ** Full** | 75+ | 40+ | ~ 30-45 min |
154- | ** Custom** | User-selected | User-selected | Varies |
153+ | Type | Vulnerabilities | Attacks | Time |
154+ | ---------- | --------------- | ------------- | ---------- |
155+ | ** Basic** | 5 curated | 6 | ~ 2-3 min |
156+ | ** Full** | 75+ | 40+ | ~ 30-45 min |
157+ | ** Custom** | User-selected | User-selected | Varies |
155158
156159### Compliance Frameworks
157160
@@ -165,17 +168,18 @@ uvx rogue-ai cli \
165168
166169### Attack Categories
167170
168- | Category | Examples |
169- | ----------| ----------|
170- | Encoding | Base64, ROT13, Leetspeak |
171- | Social Engineering | Roleplay, trust building |
172- | Injection | Prompt injection, SQL injection |
173- | Semantic | Goal redirection, context poisoning |
174- | Technical | Gray-box probing, permission escalation |
171+ | Category | Examples |
172+ | ------------------ | --------------------------------------- |
173+ | Encoding | Base64, ROT13, Leetspeak |
174+ | Social Engineering | Roleplay, trust building |
175+ | Injection | Prompt injection, SQL injection |
176+ | Semantic | Goal redirection, context poisoning |
177+ | Technical | Gray-box probing, permission escalation |
175178
176179### Risk Scoring (CVSS-based)
177180
178181Each vulnerability receives a ** 0-10 risk score** based on:
182+
179183- ** Impact** — Severity if exploited
180184- ** Exploitability** — Success rate likelihood
181185- ** Human Factor** — Manual exploitation potential
@@ -215,15 +219,15 @@ GOOGLE_API_KEY="..."
215219
216220## Key Features
217221
218- | Feature | Description |
219- | ---------| -------------|
220- | 🔄 Dynamic Scenarios | Auto-generate tests from business context |
221- | 👀 Live Monitoring | Watch agent conversations in real-time |
222- | 📊 Comprehensive Reports | Markdown, CSV, JSON exports |
222+ | Feature | Description |
223+ | ------------------------ | -------------------------------------------- |
224+ | 🔄 Dynamic Scenarios | Auto-generate tests from business context |
225+ | 👀 Live Monitoring | Watch agent conversations in real-time |
226+ | 📊 Comprehensive Reports | Markdown, CSV, JSON exports |
223227| 🔍 Multi-Faceted Testing | Policy compliance + security vulnerabilities |
224- | 🤖 Model Support | OpenAI, Anthropic, Google (via LiteLLM) |
225- | 🛡️ CVSS Scoring | Industry-standard risk assessment |
226- | 🔁 Reproducible | Deterministic scans with random seeds |
228+ | 🤖 Model Support | OpenAI, Anthropic, Google (via LiteLLM) |
229+ | 🛡️ CVSS Scoring | Industry-standard risk assessment |
230+ | 🔁 Reproducible | Deterministic scans with random seeds |
227231
228232---
229233
0 commit comments