Conversation
Design for adding test change classification to verify-pr's Step 12, distinguishing additive test changes (new coverage) from reductive ones (weakened coverage). Uses a sub-agent with hybrid structural scan + semantic assessment, validated against two real trustify PRs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Contributor
Reviewer's GuideAdds a detailed design spec document describing how verify-pr will classify test changes as additive vs reductive using a dedicated sub-agent that performs a hybrid structural scan and semantic assessment, integrates its results into Step 12 (Test Quality) and Step 14 reporting, and outlines SKILL.md updates and future autonomous-mode escalation behavior. File-Level Changes
Tips and commandsInteracting with Sourcery
Customizing Your ExperienceAccess your dashboard to:
Getting Help
|
Contributor
There was a problem hiding this comment.
Hey - I've left some high level feedback:
- The Step 12 sub-step renumbering in the SKILL.md integration section is a bit confusing (existing steps 1–7 vs. new steps 8–13); it would help to explicitly map old → new step numbers so someone editing SKILL.md can apply the changes without ambiguity.
- In the structural scan section, consider clarifying how the sub-agent identifies the test framework / language for a file (e.g., by extension, heuristics, or upstream metadata) and what it should do when it cannot confidently recognize assertions or skip annotations.
- For the sub-agent interface, it might be worth specifying a more structured output format (e.g., JSON schema for classification, structural summary, and reductive findings) so that the main agent can consume the results reliably and avoid parsing free-form text.
Prompt for AI Agents
Please address the comments from this code review:
## Overall Comments
- The Step 12 sub-step renumbering in the SKILL.md integration section is a bit confusing (existing steps 1–7 vs. new steps 8–13); it would help to explicitly map old → new step numbers so someone editing SKILL.md can apply the changes without ambiguity.
- In the structural scan section, consider clarifying how the sub-agent identifies the test framework / language for a file (e.g., by extension, heuristics, or upstream metadata) and what it should do when it cannot confidently recognize assertions or skip annotations.
- For the sub-agent interface, it might be worth specifying a more structured output format (e.g., JSON schema for classification, structural summary, and reductive findings) so that the main agent can consume the results reliably and avoid parsing free-form text.Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Closes spike TC-4153
Test plan
🤖 Generated with Claude Code
Summary by Sourcery
Documentation: