Improve verify-tests-fail-without-fix Skill #33513

kubaflo · 2026-01-13T16:55:48Z

Note

Are you waiting for the changes in this PR to be merged?
It would be very helpful if you could test the resulting artifacts from this PR and let us know in a comment if this change resolves your issue. Thank you!

Description

This PR improves the verify-tests-fail-without-fix skill by fixing fork repository detection, improving error handling, and making full verification the recommended/default behavior.

Changes

1. Added `git fetch origin` for Accurate Diffs

Fetches the base branch from origin before computing git diff
Ensures accurate fix file detection even in fork repositories
Prevents false negatives when base branch is stale locally

2. Improved Verification Mode Behavior

Made full verification the primary/default mode - tests must FAIL without fix and PASS with fix
Added -RequireFullVerification parameter - prevents silent fallback to verify-only mode
Improved error handling - script now warns/errors if no fix files detected (instead of silently switching modes)
Verify-only mode still available - runs when no fix files detected AND -RequireFullVerification not set
Clearer expectations - user explicitly controls whether verify-only fallback is allowed

3. Improved Documentation

Updated SKILL.md to reflect improved mode behavior
Clarified requirements (fix files + test files for full verification)
Updated mode selection table to show both modes clearly
Added troubleshooting guidance for "no fix files detected" scenario

4. Better Error Handling

Script exits early with actionable error if no fix files found AND -RequireFullVerification set
Provides explicit solutions (use -FixFiles or -BaseBranch explicitly)
Reduced ambiguity - user knows exactly what mode is running

Why These Changes?

Problem 1: Fork Repository Detection

When working in a fork, the base branch might not be up-to-date locally, causing git diff to produce inaccurate results. This led to:

Missing fix files in the diff
False "only test files" mode activation
Incorrect verification results

Solution: Always fetch the base branch from origin before computing diff.

Problem 2: Confusing Auto-Detection

The previous auto-detection system was unclear:

Silent fallback to "verify failure only" when no fix files detected
Users didn't know which mode was running
Made debugging harder

Solution:

Add -RequireFullVerification parameter to make expectations explicit
Error immediately if user expects full verification but no fix files found
Maintain verify-only mode for legitimate test-first workflows

Problem 3: Silent Failures

When no fix files were detected (due to stale base branch or wrong base detection), the script would silently fall back to "verify failure only" mode, which wasn't the intended behavior.

Solution: Fail fast with clear error message and troubleshooting steps when -RequireFullVerification is set.

Testing

Verified the changes work correctly with:

✅ Fork repositories (fetch ensures accurate base branch)
✅ PRs with fix files + test files (full verification runs)
✅ Clear error when no fix files detected with -RequireFullVerification
✅ Verify-only mode still works when appropriate (without the flag)
✅ -RequireFullVerification parameter enforces strict mode

Example Usage

# Recommended for PR validation - ensures full verification
pwsh .github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1 -Platform android -RequireFullVerification

# With explicit test filter
pwsh .github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1 -Platform ios -TestFilter "Issue33356" -RequireFullVerification

# Override fix files or base branch if auto-detection fails
pwsh .github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1 -Platform android -FixFiles @("src/Core/File.cs") -BaseBranch "main"

# Test-first workflow (verify tests fail before writing fix)
pwsh .github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1 -Platform ios

Breaking Changes

⚠️ Behavior Change: The skill now requires explicit intent when fix files are not detected:

Without -RequireFullVerification: Falls back to verify-only mode (tests only need to FAIL)
With -RequireFullVerification: Errors immediately if no fix files found

Recommended usage: Always use -RequireFullVerification when validating PRs to ensure full verification (tests FAIL without fix, PASS with fix).

Migration: Update any automated scripts that call this skill to add -RequireFullVerification for PR validation workflows.

Why This Correction Matters

Code Reality: The implementation shows verify-only mode still exists at lines 173-193:

if ($DetectedFixFiles.Count -eq 0) {
    Write-Host "║         VERIFY FAILURE ONLY MODE                          ║"
    # ... mode implementation
}

Impact of Inaccuracy:

Future maintainers reading "removed" will be confused when they see the mode in code
Users might think they can't use verify-only mode for test-first workflows
Documentation doesn't match reality

Correction Approach:

Acknowledge mode still exists
Explain it's no longer the silent fallback (key improvement)
Show it's still useful for test-first development
Emphasize full verification is now recommended/default

…re full verification

dotnet-policy-service · 2026-01-13T16:56:10Z

Hey there @@kubaflo! Thank you so much for your PR! Someone from the team will get assigned to your PR shortly and we'll get it reviewed.

Copilot

Pull request overview

This PR modifies the verify-tests-fail-without-fix skill to simplify its behavior and improve fork repository detection. The main changes include:

Changes:

Removed the "verify failure only" mode (when only test files changed)
Added git fetch origin to ensure base branch is up-to-date before computing diffs
Added -RequireFullVerification parameter to enforce full verification mode
Simplified documentation to focus on single-mode behavior (full verification only)

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
`.github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1`	Removed ~100 lines of "verify failure only" mode logic, added git fetch for base branch, added RequireFullVerification parameter and error handling
`.github/skills/verify-tests-fail-without-fix/SKILL.md`	Updated documentation to remove references to dual-mode behavior and auto-detection table, simplified to describe only full verification mode

.github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1

.github/skills/verify-tests-fail-without-fix/SKILL.md

.github/skills/verify-tests-fail-without-fix/scripts/verify-tests-fail.ps1

.github/skills/verify-tests-fail-without-fix/SKILL.md

Enhanced verify-tests-fail.ps1 to support a mode where, if no fix files are detected, it auto-detects changed test files, runs only the relevant tests, and expects them to fail. This mode helps validate that new or updated tests correctly reproduce a bug before a fix is implemented, providing detailed output and guidance for users.

Introduces $BaseBranchRef to consistently reference the latest base branch, whether local or origin, for all git diff, ls-tree, and checkout operations. This improves reliability when the local branch is stale or fetch operations fail, and simplifies the logic for detecting changed files and reverting fixes.

Expanded documentation to describe two workflow modes: 'Verify Failure Only' for test creation and 'Full Verification' for validating both tests and fixes. Added usage examples, clarified requirements, and detailed expected outputs for each mode to improve clarity for contributors.

Added the -RequireFullVerification flag to all usage examples for consistency and clarified its purpose in the documentation.

Enhanced the verify-tests-fail.ps1 script to support two modes: verify failure only (when no fix files are detected) and full verification (when fix files are present). Updated documentation to clarify usage, parameters, and examples for both modes, improving usability and clarity.

Adds the -RequireFullVerification flag to the PowerShell test verification command in the PR documentation to ensure full verification mode is used.

Improve verify-tests-fail skill: add fetch, fix fork detection, requi…

57f8cc6

…re full verification

Copilot AI review requested due to automatic review settings January 13, 2026 16:55

dotnet-policy-service bot added the community ✨ Community Contribution label Jan 13, 2026

kubaflo self-assigned this Jan 13, 2026

kubaflo assigned PureWeen Jan 13, 2026

kubaflo added the area-ai label Jan 13, 2026

Copilot started reviewing on behalf of kubaflo January 13, 2026 16:56 View session

Copilot AI reviewed Jan 13, 2026

View reviewed changes

kubaflo added 6 commits January 13, 2026 23:52

Update usage examples for verify-tests-fail script

8f71e28

Added the -RequireFullVerification flag to all usage examples for consistency and clarified its purpose in the documentation.

Update test verification command in PR guide

ed832e5

Adds the -RequireFullVerification flag to the PowerShell test verification command in the PR documentation to ensure full verification mode is used.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve verify-tests-fail-without-fix Skill #33513

Improve verify-tests-fail-without-fix Skill #33513

kubaflo commented Jan 13, 2026 •

edited

Loading

Uh oh!

dotnet-policy-service bot commented Jan 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Improve verify-tests-fail-without-fix Skill #33513

Are you sure you want to change the base?

Improve verify-tests-fail-without-fix Skill #33513

Conversation

kubaflo commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

1. Added git fetch origin for Accurate Diffs

2. Improved Verification Mode Behavior

3. Improved Documentation

4. Better Error Handling

Why These Changes?

Problem 1: Fork Repository Detection

Problem 2: Confusing Auto-Detection

Problem 3: Silent Failures

Testing

Example Usage

Breaking Changes

Related

Change 2: Breaking Changes Section

Change 3: Added Test-First Workflow Example

Why This Correction Matters

Uh oh!

dotnet-policy-service bot commented Jan 13, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kubaflo commented Jan 13, 2026 •

edited

Loading

1. Added `git fetch origin` for Accurate Diffs