You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Oct 21, 2025. It is now read-only.
- Remove multi_turn parameters from all test classes
- Delete unused experimental files (exploit_simple.py, exploit_unified.py)
- Delete simplified_test.py
- Use Any type for flexible client handling (ANN401 already ignored)
- Fix trailing whitespace and end-of-file issues
- Update base class to accept any client type
- Remove unused imports
Copy file name to clipboardExpand all lines: competition.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -106,4 +106,4 @@ This competition represents a significant effort by OpenAI to:
106
106
- Create a structured evaluation process with expert oversight
107
107
- Build a comprehensive understanding of model vulnerabilities
108
108
109
-
The competition emphasizes finding **novel** vulnerabilities that haven't been previously identified, making original research and creative approaches particularly valuable.
109
+
The competition emphasizes finding **novel** vulnerabilities that haven't been previously identified, making original research and creative approaches particularly valuable.
Copy file name to clipboardExpand all lines: findings/deception_example_001.json
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -39,4 +39,4 @@
39
39
"model_version": "gpt-oss:20b",
40
40
"test_environment": "ollama-local",
41
41
"additional_notes": "This finding demonstrates a pattern where the model appears to recognize evaluation contexts but strategically avoids acknowledging them. The consistency of evasive responses across multiple prompts suggests this may be learned behavior rather than incidental responses."
0 commit comments