Temporal SAE integration #575

canrager · 2025-10-22T22:56:01Z

No description provided.

This reverts commit c644790.

hijohnnylin · 2025-10-24T00:06:05Z

Loading the temporalSAE safetensor using SAE.from_pretrained or SAE.from_pretrained_with_cfg_and_sparsity fails currently due to:

W_enc doesn't exist in the state dictionary (checked Gemma 2). I printed out the state_dict_raw keys in the pretrained_sae_loaders.py - temporal_sae_huggingface_loader, and I don't see "E":
- dict_keys(['D', 'attn_layers.0.c_proj.bias', 'attn_layers.0.c_proj.weight', 'attn_layers.0.k_ctx.bias', 'attn_layers.0.k_ctx.weight', 'attn_layers.0.q_target.bias', 'attn_layers.0.q_target.weight', 'attn_layers.0.v_ctx.bias', 'attn_layers.0.v_ctx.weight', 'b'])
- @canrager please verify and reupload the safetensor files
b_dec shape is slightly off (we are expecting [2304] but b_dec is [1, 2304])
- workaround: doing squeeze on the b_dec while importing in sae.py in the converter

…actor.

chanind · 2025-10-26T21:21:22Z

sae_lens/saes/sae.py

    normalize_activations: Literal[
        "none", "expected_average_only_in", "constant_norm_rescale", "layer_norm"
    ] = "none"  # none, expected_average_only_in (Anthropic April Update), constant_norm_rescale (Anthropic Feb Update)
+    activation_normalization_factor: float = 1


Why is this needed? I'd rather avoid adding config options to the global SAE config if it's just for temporal SAEs. If constant_norm_rescale isn't used currently we should just delete it from the types IMO. Can you just fold the scaling factor into your temporal SAE weights when you load them so this isn't needed as a separate global SAE config option?

canrager and others added 11 commits October 22, 2025 03:59

clauded temporal SAE integration

9807749

setting up tests and removing intermediate files

0ddf50c

use SAELens names for weights

4dff2aa

updating implementation

d24a855

disallow folding w dec norm for temporal saes

a61df71

added temporal sae

c644790

Revert "added temporal sae"

9db2b29

This reverts commit c644790.

Merge branch 'jbloomAus:main' into main

723505d

added warning to standalone decoding

039b445

updated loading temporal sae to safetensors format

30cd265

fixing syntax issues

71e96e6

canrager and others added 10 commits October 24, 2025 09:25

testing sae inference

45b5a3d

fix: 1. W_enc not initialized for tied weights and 2. added scaling f…

b95c4a5

…actor.

added end-to-end comparison with original implementation of TemporalSAE

b2d2e56

fixed linting

29bcf9f

fix: set temporal hook_name, fix lint

58ceaa6

add neuronpedia entries to yaml

e318404

use gemma-2-2b instead of google/gemma-2-2b in pretrained yaml

c8ad55b

make W_enc optional for Temporal SAE

a6d2287

adapted tests

3861fcd

ruff formatting

dbc9926

chanind reviewed Oct 26, 2025

View reviewed changes

canrager and others added 6 commits October 26, 2025 21:01

fixed layer index of temporal Llama SAEs

1267e76

fix: temporal pretrained yaml

2fefe1b

fix: undo formatting change

fb07c41

Fix hook_resid_post ID in pretrained_saes.yaml

8231236

fix: final corrections for temporal SAEs llama yaml

cd68555

moving scaling into temporal SAEs for now

27e4c3a

chanind force-pushed the main branch from 61972d3 to 27e4c3a Compare October 27, 2025 21:02

Merge branch 'main' into canrager/main

50bfd0a

chanind merged commit 888c586 into decoderesearch:main Oct 27, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Temporal SAE integration #575

Temporal SAE integration #575

Uh oh!

canrager commented Oct 22, 2025

Uh oh!

hijohnnylin commented Oct 24, 2025 •

edited

Loading

Uh oh!

chanind Oct 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Temporal SAE integration #575

Temporal SAE integration #575

Uh oh!

Conversation

canrager commented Oct 22, 2025

Uh oh!

hijohnnylin commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chanind Oct 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hijohnnylin commented Oct 24, 2025 •

edited

Loading