A lightweight implementation of Sparse Autoencoder (SAE)

Installation

pip -r requirements.txt

Training

Single autoencoder training w/o trainer

CUDA_VISIBLE_DEVICES="0" python train.py --model_cfg_yaml ./model_configs/Llama-2-7b-chat-hf.yaml \
  --layers 16 --layer_loc residual --lr 1e-3 --batch_size 8192 --model_batch_size 8 \
  --acts_cache_dir [ACTIVATION_CACHE_DIR] --model_save_dir [MODEL_CACHE_DIR]

The final model is saved in MODEL_CACHE_DIR and can be loaded with AutoEncoder.load. The activations will be cached on disk in ACTIVATION_CACHE_DIR. During training, activations are loaded and pinned in CPU memory. Refreshing the buffer would lazy-load activations from CPU to GPU memory. I have find this to be a 2x speed up over the non-hierarchical-cache version.

Currently I only expose some configs from ActivationsBufferConfig and AutoEncoderConfig to the train.py manually. A better way to implement it is to automatically add the arguments through argparse (future TODOs).

Multiple antoencoders sweeping w/ trainer

CUDA_VISIBLE_DEVICES="0" TOKENIZERS_PARALLELISM="false" python sweep.py --model_cfg_yaml ./model_configs/Llama-2-7b-chat-hf.yaml \
  --layer 16 --layer_loc residual --lr 1e-4 1e-5 --lambda_reg 1e-1 1e-2 --parallelism 4 \
  --batch_size 1024 --model_batch_size 8 --acts_cache_dir [ACTIVATION_CACHE_DIR]

This command sweeps learning rates of {1e-4, 1e-5} and sparsity regulariations of {1e-1, 1e-2}. Sweepable parameters include lr, beta1, beta2, lambda_reg, warmup_percent, act_renorm_type, and act_renorm_scale. See details in sweep.py.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
model_configs		model_configs
.gitignore		.gitignore
README.md		README.md
autoencoder.py		autoencoder.py
autoencoder_multilayer.py		autoencoder_multilayer.py
autoencoder_multilayer_trainer.py		autoencoder_multilayer_trainer.py
autoencoder_sweeper.py		autoencoder_sweeper.py
autoencoder_trainer.py		autoencoder_trainer.py
buffer.py		buffer.py
generate_vis.py		generate_vis.py
mats_application.ipynb		mats_application.ipynb
requirements.txt		requirements.txt
sweep.py		sweep.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A lightweight implementation of Sparse Autoencoder (SAE)

Installation

Training

Single autoencoder training w/o trainer

Multiple antoencoders sweeping w/ trainer

About

Uh oh!

Releases

Packages

Languages

jybai/autoencoders

Folders and files

Latest commit

History

Repository files navigation

A lightweight implementation of Sparse Autoencoder (SAE)

Installation

Training

Single autoencoder training w/o trainer

Multiple antoencoders sweeping w/ trainer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages