Skip to content

Commit 6eb6c03

Browse files
committed
Update behavioral trigger:
1 parent 3f5d595 commit 6eb6c03

2 files changed

Lines changed: 6 additions & 6 deletions

File tree

.github/workflows/behavioral.yml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ name: behavioral
99
# .github/scripts/select_behavioral.py.
1010
# * label-gated execution -- the test job (which holds ANTHROPIC_API_KEY)
1111
# only runs on manual dispatch or when a maintainer adds the
12-
# `run-behavioral` label, keeping the secret away from untrusted / fork
12+
# `run_behavioral` label, keeping the secret away from untrusted / fork
1313
# code that runs with tool permissions bypassed.
1414
# * required when relevant -- when a PR changes a skill or test that maps to a
1515
# behavioral test, the `behavioral` gate FAILS until the label is added and
@@ -45,7 +45,7 @@ permissions:
4545
contents: read
4646

4747
env:
48-
BEHAVIORAL_LABEL: run-behavioral
48+
BEHAVIORAL_LABEL: run_behavioral
4949

5050
jobs:
5151
# Decide which skills the change affects. This is secret-free (just git diff +
@@ -96,12 +96,12 @@ jobs:
9696
name: Behavioral (${{ matrix.skill }})
9797
needs: discover
9898
# Run only when something testable changed AND the run is authorized:
99-
# manual dispatch, or a maintainer added the `run-behavioral` label. This is
99+
# manual dispatch, or a maintainer added the `run_behavioral` label. This is
100100
# the gate that protects the ANTHROPIC_API_KEY secret.
101101
if: >-
102102
needs.discover.outputs.any == 'true' &&
103103
(github.event_name == 'workflow_dispatch' ||
104-
contains(github.event.pull_request.labels.*.name, 'run-behavioral'))
104+
contains(github.event.pull_request.labels.*.name, 'run_behavioral'))
105105
runs-on: ubuntu-latest
106106
# Behavioral runs install local models and can take a while; cap it so a
107107
# hung agent or stalled model pull fails the job instead of burning minutes.
@@ -161,7 +161,7 @@ jobs:
161161
AFFECTED: ${{ needs.discover.outputs.any }}
162162
SKILLS: ${{ needs.discover.outputs.skills }}
163163
# 'true' only on a PR that carries the label; '' / 'false' otherwise.
164-
LABEL_PRESENT: ${{ contains(github.event.pull_request.labels.*.name, 'run-behavioral') }}
164+
LABEL_PRESENT: ${{ contains(github.event.pull_request.labels.*.name, 'run_behavioral') }}
165165
steps:
166166
- name: Verify behavioral results
167167
run: |

CONTRIBUTING.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -222,7 +222,7 @@ cd eval/behavioral && pytest tests/test_local_ai_use.py
222222
```
223223

224224
In CI, the `behavioral` workflow runs these tests, but **only** when a
225-
maintainer adds the `run-behavioral` label to a PR for safety.
225+
maintainer adds the `run_behavioral` label to a PR for safety.
226226

227227
## Pre-publish checklist
228228

0 commit comments

Comments
 (0)