GPT-OSS-20B Red Team Security Testing Toolkit

A defensive security testing framework for evaluating AI model vulnerabilities in the OpenAI GPT-OSS-20B Red Teaming Competition.

⚠️ Safety Disclaimer

FOR AI SAFETY RESEARCH ONLY

This toolkit is designed exclusively for legitimate security research and improving AI safety. All testing techniques are for defensive purposes - identifying vulnerabilities to make AI systems safer. Not for attacking production systems or generating harmful content.

🏆 Competition

OpenAI GPT-OSS-20B Red Teaming Challenge

Link: Kaggle Competition
Deadline: August 26, 2025
Goal: Find up to 5 novel vulnerabilities in gpt-oss-20b

🚀 Quick Start

Prerequisites

Ollama installed or OpenRouter account
Python 3.12+
uv package manager

Setup & Run

# 1. Install model
ollama pull gpt-oss:20b

# 2. Setup environment
uv sync
uv run setup

# 3. Run security tests
uv run pentest

# 4. Review results
uv run review

🔍 Test Categories

Tests cover 9 vulnerability categories including:

Deception & lying
Reward hacking
Sabotage & harmful behaviors
Tool misuse
Data exfiltration
Evaluation gaming

📊 Key Commands

uv run pentest          # Run security tests
uv run attack           # Multi-turn attack testing
uv run review           # Interactive result review
uv run findings         # Browse exported findings
uv run report           # Generate comprehensive report

🎯 Winning Strategy

Focus on discovering:

Novel attack vectors not in literature
Severe vulnerabilities with real-world impact
Reproducible exploits with automated harnesses
Broad vulnerabilities affecting many users
Insightful methodologies revealing model behavior

📜 License

Code: Licensed under Apache 2.0
Datasets: Licensed under CC-0 (public domain)

See LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
.github/workflows		.github/workflows
doc		doc
findings		findings
sessions		sessions
src		src
tests		tests
.env.template		.env.template
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
competition.md		competition.md
config.yaml		config.yaml
pyproject.toml		pyproject.toml
test_interactive.py		test_interactive.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPT-OSS-20B Red Team Security Testing Toolkit

⚠️ Safety Disclaimer

🏆 Competition

🚀 Quick Start

Prerequisites

Setup & Run

🔍 Test Categories

📊 Key Commands

🎯 Winning Strategy

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

QuesmaOrg/red-team-testbed-for-gpt-oss

Folders and files

Latest commit

History

Repository files navigation

GPT-OSS-20B Red Team Security Testing Toolkit

⚠️ Safety Disclaimer

🏆 Competition

🚀 Quick Start

Prerequisites

Setup & Run

🔍 Test Categories

📊 Key Commands

🎯 Winning Strategy

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages