fix: handle LayerNorm folding correctly in load_and_process_state_dict by VedantMadane · Pull Request #1215 · TransformerLensOrg/TransformerLens

VedantMadane · 2026-03-26T09:29:34Z

Fixes #219

Problem

load_and_process_state_dict(state_dict, fold_ln=True) has two failure modes when called directly (outside the from_pretrained path):

Unfolded state dict: fold_layer_norm correctly removes LN keys from the state dict, but the model's LayerNorm modules are never replaced with LayerNormPre. This leaves mismatched architecture (modules expect w/b params that no longer exist) and broken hooks after loading.
Already-folded state dict: fold_layer_norm crashes with KeyError because it tries to access blocks.{l}.ln1.w keys that were already removed when the model was saved.

The workaround (from the issue) was a 3-step dance:

model.load_and_process_state_dict(state_dict, fold_ln=False)
model.process_weights_(fold_ln=True)
model.setup()

Fix

Detect already-folded state dicts: Check if LN weight keys (.ln1.w, .ln2.w, ln_final.w) exist before folding. If missing, skip with a warning instead of crashing.
Replace LN modules when folding: Move the LayerNorm -> LayerNormPre (and RMSNorm -> RMSNormPre) module replacement from process_weights_ into load_and_process_state_dict, so direct callers get the same correct behavior.
Re-attach hooks: Call self.setup() after loading when fold_ln=True to ensure hooks are properly connected to the new modules.
Simplify process_weights_: Since the module replacement now lives in load_and_process_state_dict, process_weights_ can simply delegate without duplicating the logic.

What this enables

# Both of these now work correctly:
model = HookedTransformer(cfg)
model.load_and_process_state_dict(unfolded_state_dict, fold_ln=True)  # was broken

model = HookedTransformer(cfg)
model.load_and_process_state_dict(already_folded_dict, fold_ln=True)  # was KeyError

Previously, calling load_and_process_state_dict(state_dict, fold_ln=True) had two failure modes: 1. If the state_dict had unfolded LN weights, fold_layer_norm removed the LN keys but the model's modules were not replaced with LNPre, leaving mismatched architecture and broken hooks. 2. If the state_dict was already folded (no LN keys), fold_layer_norm crashed with a KeyError trying to access missing LN weight keys. Fix both by: - Checking whether LN keys exist before attempting to fold (skip with warning if already folded) - Replacing LN/RMS modules with LNPre/RMSPre before folding, matching the logic previously only in process_weights_ - Calling self.setup() after loading to re-attach hooks - Simplifying process_weights_ to delegate fully to the fixed method Fixes TransformerLensOrg#219 Signed-off-by: Vedant Madane <6527493+VedantMadane@users.noreply.github.com>

jlarson4 · 2026-04-02T15:59:58Z

In the future, please run make check-format & resolve any formatting errors before submitting code. There is a ton of noise in the File Changes diff that make this contribution difficult to review.

For a change like this that is meant to address specific failures, it is important to add new Unit Tests covering these previous failed states (to prevent regression) and any new features/functionalities you add (to make sure they don't break in the future).

VedantMadane · 2026-04-07T17:33:51Z

Apologies for the formatting noise — the diff is now cleaned up to contain only the logic changes. No auto-formatter reformatting of existing lines.

jlarson4 · 2026-04-07T17:44:46Z

Apologies for the formatting noise — the diff is now cleaned up to contain only the logic changes. No auto-formatter reformatting of existing lines.

Thank you! I will take a look at reviewing this soon

VedantMadane added 2 commits March 26, 2026 14:59

style: fix black formatting in HookedTransformer.py

41c64e7

jlarson4 changed the base branch from main to dev April 2, 2026 15:53

fix: handle LayerNorm folding correctly in load_and_process_state_dict

5efeb3f

VedantMadane force-pushed the fix/load-state-dict-fold-ln branch from 6ab9305 to 5efeb3f Compare April 7, 2026 17:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle LayerNorm folding correctly in load_and_process_state_dict#1215

fix: handle LayerNorm folding correctly in load_and_process_state_dict#1215
VedantMadane wants to merge 3 commits intoTransformerLensOrg:devfrom
VedantMadane:fix/load-state-dict-fold-ln

VedantMadane commented Mar 26, 2026

Uh oh!

jlarson4 commented Apr 2, 2026

Uh oh!

VedantMadane commented Apr 7, 2026

Uh oh!

jlarson4 commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

VedantMadane commented Mar 26, 2026

Problem

Fix

What this enables

Uh oh!

jlarson4 commented Apr 2, 2026

Uh oh!

VedantMadane commented Apr 7, 2026

Uh oh!

jlarson4 commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants