SPD - Stochastic Parameter Decomposition

Note: The spd-paper branch contains code used in the paper Stochastic Parameter Decomposition. The main branch contains active work from Goodfire and collaborators since this paper's release. This is now an open source research project. Please feel free to view the issues (or add to them) and make a PR!

Weights and Bias report accompanying the paper.

Installation

From the root of the repository, run one of

make install-dev  # To install the package, dev requirements, pre-commit hooks, and create user files
make install  # To install the package (runs `pip install -e .`) and create user files

Usage

Place your wandb information in a .env file. See .env.example for an example.

The repository consists of several experiments, each of which containing scripts to run SPD, analyse results, and optionally a train a target model:

spd/experiments/tms - Toy model of superposition
spd/experiments/resid_mlp - Toy model of compressed computation and toy model of distributed representations
spd/experiments/ih - Small model trained on a toy induction head task.
spd/experiments/lm - Language model loaded from huggingface.
There is also a toy MLP memorization experiment at feature/memorization-experiments which has not been merged to main (much less effort has gone in to validating it).

Note that the lm experiment allows for running SPD on any model from huggingface or that is accessible to the environment, provided that you only need to decompose nn.Linear, nn.Embedding or transformers.modeling_utils.Conv1D layers (other layer types are not yet supported, though these should cover most modules). See spd/experiments/lm/ss_gpt2_config.yaml for an example which loads from huggingface and spd/experiments/lm/ss_gpt2_simple_config.yaml for an example which loads from https://github.com/goodfire-ai/simple_stories_train (with the model weights saved on wandb).

CLI Commands

The following CLI commands are available after installation:

`spd-local` - Local Execution

Run a single SPD experiment locally without SLURM:

spd-local tms_5-2           # Run on single GPU (default)
spd-local tms_5-2 --cpu     # Run on CPU
spd-local tms_5-2 --dp 4    # Run on 4 GPUs (single node DDP)

`spd-run` - SLURM Cluster Execution

Run experiments on a SLURM cluster with git snapshots and W&B integration:

spd-run --experiments tms_5-2                    # Run a specific experiment
spd-run --experiments tms_5-2,resid_mlp1         # Run multiple experiments
spd-run                                          # Run all experiments
spd-run --experiments tms_5-2 --cpu              # Run on CPU
spd-run --experiments ss_llama_simple --dp 4     # Data parallelism (4 GPUs)

For running hyperparameter sweeps:

spd-run --experiments <experiment_name> --sweep --n_agents <n-agents> [--cpu]

Sweep parameters:

Default sweep parameters are loaded from spd/scripts/sweep_params.yaml
You can specify a custom sweep parameters file by passing its path to --sweep

Sweep parameters support both experiment-specific and global configurations:

# Params used for all experiments
global:
  seed:
    values: [0, 1]
  loss_metric_configs:
    - classname: "ImportanceMinimalityLoss"
      coeff:
        values: [0.1, 0.2]

# Experiment-specific parameters (override global)
tms_5-2:
  seed:
    values: [100, 200]  # Overrides global seed
  task_config:
    feature_probability:
      values: [0.05, 0.1]

Post-Processing Pipeline

After training an SPD model, you can run several post-processing steps which create artifacts that can be viewed in the app.

Note, the default batch size for harvest and dataset attributions is 256, which works well for models like ss_llama_simple_mlp-2L-wide. This is configurable with --batch_size. You should also configure --n_gpus based on how many gpus you have available. You can use any number you'd like.

1. Harvest Component Statistics

First, collect component statistics (activation examples, correlations, token stats):

spd-harvest goodfire/spd/runs/abc123 --n_gpus 24

2. Automated Interpretation

After harvesting, generate LLM interpretations of components:

spd-autointerp goodfire/spd/runs/abc123

Requires OPENROUTER_API_KEY env var.

3. Dataset Attributions

Compute component-to-component attribution strengths:

spd-attributions goodfire/spd/runs/abc123 --n_gpus 24

Can be run independently of harvest/autointerp.

4. Clustering (TODO)

Component clustering analysis is under development.

Direct Script Execution

SPD can also be run by executing any of the *_decomposition.py scripts defined in the experiment subdirectories, along with a corresponding config file:

python spd/experiments/tms/tms_decomposition.py spd/experiments/tms/tms_5-2_config.yaml

Model Comparison

Use the model comparison script to analyse (post hoc) the geometric similarities between subcomponents of two different models:

python spd/scripts/compare_models/compare_models.py spd/scripts/compare_models/compare_models_config.yaml

See spd/scripts/compare_models/README.md for detailed usage instructions.

Development

Suggested extensions and settings for VSCode/Cursor are provided in .vscode/. To use the suggested settings, copy .vscode/settings-example.json to .vscode/settings.json.

App

This project contains an app for visualising and interpreting SPD decompositions. See the README.

Contributing

Please read our CONTRIBUTING.md file for guidelines on how to contribute to this project, including pull request requirements and review processes.

Development Commands

There are various make commands that may be helpful.

make check  # Run pre-commit on all files (i.e. basedpyright, ruff linter, and ruff formatter)
make type  # Run basedpyright on all files
make format  # Run ruff linter and formatter on all files
make test  # Run tests that aren't marked `slow`
make test-all  # Run all tests

Name		Name	Last commit message	Last commit date
Latest commit History 303 Commits
.cursor		.cursor
.github		.github
.vscode		.vscode
papers		papers
scripts		scripts
spd		spd
tests		tests
typings/transformers		typings/transformers
.env.example		.env.example
.gitignore		.gitignore
.mcp.json		.mcp.json
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SPD - Stochastic Parameter Decomposition

Installation

Usage

CLI Commands

`spd-local` - Local Execution

`spd-run` - SLURM Cluster Execution

Post-Processing Pipeline

1. Harvest Component Statistics

2. Automated Interpretation

3. Dataset Attributions

4. Clustering (TODO)

Direct Script Execution

Model Comparison

Development

App

Contributing

Development Commands

About

Uh oh!

Releases 1

Packages

Contributors 16

Uh oh!

Languages

License

goodfire-ai/spd

Folders and files

Latest commit

History

Repository files navigation

SPD - Stochastic Parameter Decomposition

Installation

Usage

CLI Commands

spd-local - Local Execution

spd-run - SLURM Cluster Execution

Post-Processing Pipeline

1. Harvest Component Statistics

2. Automated Interpretation

3. Dataset Attributions

4. Clustering (TODO)

Direct Script Execution

Model Comparison

Development

App

Contributing

Development Commands

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 16

Uh oh!

Languages

`spd-local` - Local Execution

`spd-run` - SLURM Cluster Execution

Packages