Skip to content

Conversation

@BentleyRolling
Copy link

Summary

Adds an example notebook demonstrating cross-SAE feature alignment using the external FeatureMatch package (MIT, Course Correct Labs). Provides cosine similarity heatmap, top-k matches, and quick stats.

Why

Researchers often need to compare whether two SAEs learned similar dictionaries across seeds/hparams. This example standardizes a quick, reproducible check.

How to use

# !pip install "git+https://github.com/Course-Correct-Labs/featurematch.git"
from featurematch.featurematch import align_features
res = align_features(Z_a, Z_b, topk=5, threshold=0.8)
print(res.stats)

Notes

  • Default cells run on synthetic data (works in CI/docs).
  • A commented section shows how to collect real codes from SAELens SAEs (same hook/layer + dataset).
  • Happy to upstream a minimal internal copy later if maintainers prefer tighter integration.

Checklist

  • Example runs end-to-end on CPU
  • No external keys or data required
  • Clear interpretation guidance included

- Add tutorial notebook showing cross-SAE feature alignment
- Runs on synthetic data by default (works in CI/docs builds)
- Includes commented section for real SAE code collection
- Add to Community Tutorials section in docs/index.md
- External package: FeatureMatch (MIT, Course Correct Labs)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant