You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. Cache activations for the first 10 million tokens of EleutherAI/rpj-v2-sample.
23
23
2. Generate explanations for the first 100 features of layer 5 using the specified explainer model.
24
24
3. Score the explanations uses fuzzing and detection scorers.
25
-
4. Log summary metrics including per-scorer F1 scores and confusion matrices.
25
+
4. Log summary metrics including per-scorer F1 scores and confusion matrices, and produce histograms of the scorer classification accuracies.
26
26
27
27
The pipeline is highly configurable and can also be called programmatically (see the [end-to-end test](https://github.com/EleutherAI/delphi/blob/main/delphi/tests/e2e.py) for an example).
0 commit comments