Skip to content

v6.20.0

Choose a tag to compare

@github-actions github-actions released this 27 Oct 21:29
· 2 commits to main since this release

v6.20.0 (2025-10-27)

Feature

  • feat: Temporal SAE integration (#575)

  • clauded temporal SAE integration

  • setting up tests and removing intermediate files

  • use SAELens names for weights

  • updating implementation

  • disallow folding w dec norm for temporal saes

  • added temporal sae

  • Revert "added temporal sae"

This reverts commit c644790.

  • added warning to standalone decoding

  • updated loading temporal sae to safetensors format

  • fixing syntax issues

  • testing sae inference

  • fix: 1. W_enc not initialized for tied weights and 2. added scaling factor.

  • added end-to-end comparison with original implementation of TemporalSAE

  • fixed linting

  • fix: set temporal hook_name, fix lint

  • add neuronpedia entries to yaml

  • use gemma-2-2b instead of google/gemma-2-2b in pretrained yaml

  • make W_enc optional for Temporal SAE

  • adapted tests

  • ruff formatting

  • fixed layer index of temporal Llama SAEs

  • fix: temporal pretrained yaml

  • fix: undo formatting change

  • Fix hook_resid_post ID in pretrained_saes.yaml

  • fix: final corrections for temporal SAEs llama yaml

  • moving scaling into temporal SAEs for now


Co-authored-by: David Chanin <[email protected]>
Co-authored-by: Johnny Lin <[email protected]> (888c586)