[BREAKING] FEAT: Ensemble scoring for Crescendo #905

martinpollack · 2025-04-30T16:47:21Z

Description

This change creates a full pipeline for performing ensemble scoring with crescendo. Included are two new scorers: EnsembleScorer which is the driver of this change and allows results of many scorers to be aggregated, as well as SubstringsMultipleScorer which extends SubstringScorer to allow multiple strings to be searched for in a response. In addition, the crescendo orchestrator has been updated to abstract out the logic for creating the objective scorer. This is now created outside of the orchestrator in a new notebook which has been created as a template to demonstrate the capabilities of a crescendo ensemble orchestrator.

Received support from @eugeniavkim @jbolor21.

This change is breaking because it changes how a CrescendoOrchestrator object is instantiated. Instead of providing a PromptChatTarget as a scoring target for the scorer, the user needs to create a Scorer object outside of the CrescendoOrchestrator and then pass it to objective_float_scale_scorer to be used for scoring. This just abstracts the objective scorer outside of the Orchestrator object and allows for more flexibility.

Tests and Documentation

Still in pogress

pyrit/score/substrings_multiple_scorer.py

pyrit/score/ensemble_scorer.py

pyrit/score/substrings_multiple_scorer.py

martinpollack · 2025-05-31T16:53:10Z

@microsoft-github-policy-service agree

Martin Pollack added 5 commits April 22, 2025 14:55

create ensemble scorer/orchestrator classes

6bfcadd

create POC example for ensemble orchestrator

068083f

new substring scorer to search for multiple substrings

8a7ea9b

abstract objective scorer out of orchestrator, create weight step

c15de4f

replace crescendo orchestrator with ensemble variant

9cb69c2

romanlutz reviewed Apr 30, 2025

View reviewed changes

pyrit/score/substrings_multiple_scorer.py Outdated Show resolved Hide resolved

pyrit/score/ensemble_scorer.py Show resolved Hide resolved

bashirpartovi reviewed May 2, 2025

View reviewed changes

Martin Pollack added 4 commits May 12, 2025 18:46

improve typing, add clarity

3c80130

remove SubStringsMultipleScorer

ad23794

do not provide default ground truth scorer for ensemble scorer

8c846e2

add unit tests

e500c78

romanlutz assigned eugeniavkim and jbolor21 May 30, 2025

martinpollack changed the title ~~[DRAFT] [BREAKING] FEAT: Ensemble scoring for Crescendo~~ [BREAKING] FEAT: Ensemble scoring for Crescendo Jun 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BREAKING] FEAT: Ensemble scoring for Crescendo #905

[BREAKING] FEAT: Ensemble scoring for Crescendo #905

Uh oh!

martinpollack commented Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

martinpollack commented May 31, 2025

Uh oh!

Uh oh!

[BREAKING] FEAT: Ensemble scoring for Crescendo #905

Are you sure you want to change the base?

[BREAKING] FEAT: Ensemble scoring for Crescendo #905

Uh oh!

Conversation

martinpollack commented Apr 30, 2025

Description

Tests and Documentation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

martinpollack commented May 31, 2025

Uh oh!

Uh oh!