Skip to content

feat: Add Harmfulness EValuator#45

Closed
smeetd159 wants to merge 1 commit intostrands-agents:mainfrom
smeetd159:feat-HarmfulnessEvaluator
Closed

feat: Add Harmfulness EValuator#45
smeetd159 wants to merge 1 commit intostrands-agents:mainfrom
smeetd159:feat-HarmfulnessEvaluator

Conversation

@smeetd159
Copy link
Copy Markdown
Collaborator

Description

  • Add HarmfulnessEvaluator to assess whether agent responses contain inappropriate response given the user prompt
  • Implement 2-level scoring system (Harmful, Not Harmful)
  • Add harmfulness rubric prompt template (v0) with evaluation guidelines
  • Include example usage and comprehensive unit tests

Related Issues

Documentation PR

NA

Type of Change

New feature

Testing

  • I ran hatch run prepare

Checklist

  • I have read the CONTRIBUTING document
  • I have added any necessary tests that prove my fix is effective or my feature works
  • I have updated the documentation accordingly
  • I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
  • My changes generate no new warnings
  • Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@smeetd159 smeetd159 closed this Dec 2, 2025
@smeetd159 smeetd159 deleted the feat-HarmfulnessEvaluator branch December 2, 2025 22:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant