Skip to content

Add orthogonal steering baseline experiment runner#6

Open
xocelyk wants to merge 3 commits into
feat/caa-steeringfrom
codex/implement-orthogonal-steering-baseline-experiments
Open

Add orthogonal steering baseline experiment runner#6
xocelyk wants to merge 3 commits into
feat/caa-steeringfrom
codex/implement-orthogonal-steering-baseline-experiments

Conversation

@xocelyk
Copy link
Copy Markdown
Owner

@xocelyk xocelyk commented Aug 19, 2025

Summary

  • add run_orthogonal_steering_experiments.py to evaluate random orthogonal activation perturbations as a baseline for probe steering
  • provide orthogonal steering config templates for Gemma and Qwen models

Testing

  • python -m py_compile run_orthogonal_steering_experiments.py
  • python run_orthogonal_steering_experiments.py --help

https://chatgpt.com/codex/tasks/task_e_68a438bdf6b4832f9f5e7846c427a2cc

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant