-
Notifications
You must be signed in to change notification settings - Fork 150
feat: Add BEAST framework for AI-resistant behavioral testing #51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
michaeljabbour
wants to merge
11
commits into
microsoft:main
Choose a base branch
from
michaeljabbour:feature/beast-framework
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
feat: Add BEAST framework for AI-resistant behavioral testing #51
michaeljabbour
wants to merge
11
commits into
microsoft:main
from
michaeljabbour:feature/beast-framework
+26,625
−18
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
🌍 Enable Amplifier's powerful AI agents and tools on any codebase, anywhere This major enhancement allows developers to harness Amplifier's 20+ specialized agents (zen-architect, bug-hunter, security-guardian, etc.) on any project without copying files or modifying existing repositories. ✨ New Features: - Global 'amplifier' command for system-wide access - Smart auto-detection of Amplifier installation location - Enhanced startup scripts with comprehensive error handling - Seamless integration with existing Claude workflows - Cross-platform compatibility (macOS, Linux, WSL) 🚀 Usage: make install-global # Install global command amplifier ~/my-project # Use Amplifier on any project amplifier --help # Show usage examples 📈 Benefits: - All 20+ specialized agents available on any codebase - Shared knowledge base across all projects - Same powerful automation and quality tools - Project isolation - changes only affect target project - No need to modify or copy files to existing projects 🔧 Implementation: - Enhanced amplifier-anywhere.sh with robust error handling - New bin/amplifier wrapper for global installation - Updated Makefile with install-global targets - Comprehensive documentation in README - Fixed Claude settings path resolution This democratizes access to Amplifier's AI development superpowers, making every codebase instantly compatible with the full Amplifier toolkit.
- Fix handling of Claude flags when no directory specified - Ensure --version flag works correctly without triggering full startup - Improve argument parsing logic to handle edge cases - Maintain backward compatibility with all usage patterns Tested scenarios: ✅ amplifier --version (shows version only) ✅ amplifier --print 'command' (uses current dir + Claude args) ✅ amplifier /path/to/project --model sonnet (explicit dir + args) ✅ amplifier /nonexistent/path (proper error handling) ✅ amplifier --help (shows help text)
- Modify .gitignore to permit bin/amplifier global command - Maintain exclusion of other build artifacts - Enable proper version control of global installation script
- Modified bin/amplifier to capture and pass the original PWD - Updated amplifier-anywhere.sh to use ORIGINAL_PWD when available - Fixes issue where 'amplifier' from any directory would default to amplifier repo instead of current dir
- Create amplifier.claude module for Claude Code integrations - Implement SessionAwareness for tracking multiple concurrent sessions - Add CLI commands: status, track, broadcast, activity - Include comprehensive test suite with 13 passing tests - Store session data in .data/session_awareness/ - Auto-cleanup stale sessions after 5 minutes - Support activity logging with automatic trimming - Follow Amplifier's ruthless simplicity philosophy - File-based JSON storage, no database complexity - Fail silently to never disrupt workflows 🤖 Generated with Claude Code Co-Authored-By: Claude <[email protected]>
- Add principles loader, searcher, synthesizer, and knowledge extractor - Extract 454 concepts, 8 patterns, and 8 insights from 11 principles - Build knowledge graph with 493 nodes and 814 edges - Add CLI commands for principles and knowledge management - Create persistent knowledge storage in amplifier/data/knowledge - Add comprehensive documentation and tests - Enable context-aware recommendations and task synthesis This integration provides intelligent access to AI-First Principles knowledge through both CLI and Python APIs, helping guide development decisions with extracted concepts, patterns, and insights. 🤖 Generated with Claude Code Co-Authored-By: Claude <[email protected]>
- Introduce BEAST (Behavioral Execution and Actual System Testing) framework - Add 'amplifier beast' CLI commands for running behavioral contracts - Add 'amplifier heal' command for auto-healing Python code issues - Include 14+ built-in contracts for verifying actual system behavior - Implement execution tracing for unfakeable test verification - Add comprehensive documentation and examples - Support continuous validation and mutation testing BEAST ensures code actually works in practice, not just in theory, making it invaluable for AI-assisted development where generated code must be verified.
pkit
reviewed
Oct 14, 2025
| test_data = context["test_data"] | ||
|
|
||
| # Save data | ||
| with open(memory_file, "w") as f: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's actually tested here? That open works in python?
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
This PR introduces the BEAST (Behavioral Execution and Actual System Testing) framework to Amplifier, providing AI-resistant behavioral testing that verifies actual runtime behavior rather than superficial test coverage.
What is BEAST?
BEAST is a testing framework designed to verify that software features actually work in practice, not just in theory. Unlike traditional unit tests that can be easily fooled by mocks or stubs, BEAST focuses on real execution and observable behavior through execution tracing.
Key Features
amplifier beastcommands for running and managing contractsamplifier healcommand for fixing common Python code issuesamplifier beast watchamplifier beast mutateChanges
New Commands
amplifier beast run- Run all or specific behavioral contractsamplifier beast list- List available contractsamplifier beast watch- Continuous monitoring modeamplifier beast mutate- Mutation testingamplifier heal- Auto-heal Python code issuesFiles Added
amplifier/beast/- Core BEAST framework implementationamplifier/cli/commands/beast.py- CLI command implementationamplifier/cli/commands/heal.py- Auto-healing commandamplifier/beast/README.md- Comprehensive documentationBuilt-in Contracts
Testing
All BEAST contracts pass with 100% success rate:
Security Review
Why This Matters
In the era of AI-generated code, we need testing frameworks that verify code actually works, not just that it looks correct. BEAST provides confidence that whether code is written by humans or AI, it delivers on its promises through real, verifiable behavior.
Documentation
Comprehensive documentation is included in
amplifier/beast/README.mdcovering:🤖 Generated with Claude Code