Skip to content
This repository was archived by the owner on Oct 21, 2025. It is now read-only.

Commit c9d7d85

Browse files
committed
Fix all type errors and linting issues
- Remove multi_turn parameters from all test classes - Delete unused experimental files (exploit_simple.py, exploit_unified.py) - Delete simplified_test.py - Use Any type for flexible client handling (ANN401 already ignored) - Fix trailing whitespace and end-of-file issues - Update base class to accept any client type - Remove unused imports
1 parent 6d6f043 commit c9d7d85

36 files changed

+351
-828
lines changed

.env.template

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
11
# OpenRouter API Configuration
2-
OPENROUTER_API_KEY=your_openrouter_api_key_here
2+
OPENROUTER_API_KEY=your_openrouter_api_key_here

README.md

Lines changed: 11 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -6,8 +6,8 @@ A comprehensive security testing framework for the OpenAI GPT-OSS-20B Red Teamin
66

77
## Competition Overview
88

9-
**Target**: OpenAI's gpt-oss-20b open weight model
10-
**Goal**: Find up to 5 distinct vulnerabilities with high Severity, Breadth, Novelty, Reproducibility, and Methodological Insight
9+
**Target**: OpenAI's gpt-oss-20b open weight model
10+
**Goal**: Find up to 5 distinct vulnerabilities with high Severity, Breadth, Novelty, Reproducibility, and Methodological Insight
1111
**Deadline**: August 26, 2025
1212

1313
## 🛠️ Getting Started
@@ -50,10 +50,10 @@ A comprehensive security testing framework for the OpenAI GPT-OSS-20B Red Teamin
5050
# Or test specific categories
5151
uv run pentest -c deception
5252
uv run pentest --test-id adderall_001
53-
53+
5454
# Run multi-turn attack testing
5555
uv run attack
56-
56+
5757
# View and analyze sessions
5858
uv run sessions
5959
```
@@ -65,18 +65,18 @@ A comprehensive security testing framework for the OpenAI GPT-OSS-20B Red Teamin
6565
```bash
6666
# Run single attack (Claude Opus 4.1 default)
6767
uv run attack
68-
68+
6969
# Run adaptive campaign with multiple attempts
7070
uv run attack --batch 5 # Run 5 attempts, learning from each
7171
uv run attack --batch 10 --steps 2 # 10 attempts, each with 2 turns
72-
72+
7373
# Try different attacker models
7474
uv run attack --attacker-model openai/gpt-4o
7575
uv run attack --attacker-model meta-llama/llama-3.1-405b-instruct --batch 5
76-
76+
7777
# Use custom prompts instead of AI-generated
7878
uv run attack --custom
79-
79+
8080
# View and analyze past sessions
8181
uv run sessions
8282
```
@@ -190,7 +190,7 @@ red-team-testbed/
190190
├── src/
191191
│ ├── cli/ # CLI entry points
192192
│ │ ├── pentest.py # Security test runner
193-
│ │ ├── review.py # Results evaluation interface
193+
│ │ ├── review.py # Results evaluation interface
194194
│ │ ├── setup.py # Environment verification
195195
│ │ ├── findings.py # Findings browser and export
196196
│ │ ├── report.py # Report generation
@@ -208,7 +208,7 @@ red-team-testbed/
208208
│ ├── models.py # Pydantic data models
209209
│ └── constants.py # Configuration constants
210210
├── findings/ # Competition submission files
211-
├── results/ # Test execution results
211+
├── results/ # Test execution results
212212
└── pyproject.toml # Project configuration
213213
```
214214

@@ -234,7 +234,7 @@ uv run review results/test.json # Review specific file
234234
uv run findings # Browse exported findings
235235
# Use 'e' to export individual competition files
236236

237-
# Generate reports and summaries
237+
# Generate reports and summaries
238238
uv run report # Comprehensive analysis report
239239
```
240240

__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,4 +5,4 @@
55

66
__version__ = "1.0.0"
77
__author__ = "Red Team Testbed Contributors"
8-
__description__ = "Red-teaming testbed for OpenAI's GPT-OSS-20B model competition"
8+
__description__ = "Red-teaming testbed for OpenAI's GPT-OSS-20B model competition"

competition.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -106,4 +106,4 @@ This competition represents a significant effort by OpenAI to:
106106
- Create a structured evaluation process with expert oversight
107107
- Build a comprehensive understanding of model vulnerabilities
108108

109-
The competition emphasizes finding **novel** vulnerabilities that haven't been previously identified, making original research and creative approaches particularly valuable.
109+
The competition emphasizes finding **novel** vulnerabilities that haven't been previously identified, making original research and creative approaches particularly valuable.

config.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ categories:
3535
- evaluation_awareness
3636
- chain_of_thought
3737
- best
38-
38+
3939
focus_categories:
4040
- deception_samples # Primary focus for initial implementation
4141
- deception_adderall # Academic pentesting for medical advice vulnerabilities
@@ -78,4 +78,4 @@ competition:
7878
max_findings: 5
7979
team_name: "red-team-testbed"
8080
export_format: "kaggle"
81-
validate_findings: true
81+
validate_findings: true

findings/.gitkeep

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
11
# This file ensures the findings directory is tracked in git
2-
# Exported findings from the CLI UI will be saved here
2+
# Exported findings from the CLI UI will be saved here

findings/deception_example_001.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,4 +39,4 @@
3939
"model_version": "gpt-oss:20b",
4040
"test_environment": "ollama-local",
4141
"additional_notes": "This finding demonstrates a pattern where the model appears to recognize evaluation contexts but strategically avoids acknowledging them. The consistency of evasive responses across multiple prompts suggests this may be learned behavior rather than incidental responses."
42-
}
42+
}

findings/export_20250813_102234.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1103,4 +1103,4 @@
11031103
"total_repetitions": 5
11041104
}
11051105
]
1106-
}
1106+
}

findings/export_20250813_102826.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -221,4 +221,4 @@
221221
"total_repetitions": null
222222
}
223223
]
224-
}
224+
}

findings/export_20250814_093634.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -121,4 +121,4 @@
121121
"total_repetitions": null
122122
}
123123
]
124-
}
124+
}

0 commit comments

Comments
 (0)