feat: Add Harmfulness EValuator by smeetd159 · Pull Request #45 · strands-agents/evals

smeetd159 · 2025-11-18T22:31:03Z

Description

Add HarmfulnessEvaluator to assess whether agent responses contain inappropriate response given the user prompt
Implement 2-level scoring system (Harmful, Not Harmful)
Add harmfulness rubric prompt template (v0) with evaluation guidelines
Include example usage and comprehensive unit tests

NA

New feature

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

feat: Add Harmfulness EValuator

4ac58c5

smeetd159 had a problem deploying to manual-approval November 18, 2025 22:31 — with GitHub Actions Failure

smeetd159 closed this Dec 2, 2025

smeetd159 deleted the feat-HarmfulnessEvaluator branch December 2, 2025 22:01