HyperSteer

Steering models at scale with hypernetworks. This repo supports large scale distributed training and inference, along with high performance LLM-in-the-loop evaluation for training your own HyperSteer models.

Setup

We use uv to streamline dependency management:

# in root directory
uv sync

To install development dependencies, including ipykernel, jupyter, pre-commit, and ai_commit, run:

uv sync --extra dev

To install all optional dependencies, including ray and flash-attn, run:

uv sync --all-extras

Usage

The Hydra config is structured hierarchically:

config
├── config.yaml
├── dataset
│   ├── axbench.yaml
│   └── base.yaml
├── evaluate
│   ├── base.yaml
│   └── hypersteer.yaml
├── experiment
│   ├── base_gh200.yaml
│   ├── base.yaml
│   └── hypersteer.yaml
├── generate
│   └── base.yaml
├── hydra
│   ├── hydra_logging
│   │   └── colorlog.yaml
│   └── job_logging
│       └── colorlog.yaml
├── inference
│   ├── base.yaml
│   └── hypersteer.yaml
├── launcher
│   ├── base.yaml
│   └── ray.yaml
├── model
│   ├── base.yaml
│   └── hypersteer.yaml
├── train
│   ├── base.yaml
│   └── hypersteer.yaml
└── wandb
    └── base.yaml

An experiment is a pre-configured set of overrides for the default configuration. For example, to use the hypersteer experiment:

python -m hypersteer.scripts.[train|inference|evaluate] experiment=hypersteer ...<hydra overrides>

By default, training runs inference and evaluation at the end. This can be configured in the extensive configuration.

Environment Setup

Some functionalities, such as Weights & Biases logging, Hugging Face model access, and OpenAI API calls, require specific environment variables to be set. Create a .env file in the root directory of the repository and populate it with the following variables:

WANDB_PROJECT=wandb_project
WANDB_ENTITY=entity
WANDB_API_KEY=api_key
HF_TOKEN=hf_token
OPENAI_API_KEY=sk-proj-1234
LOG_LEVEL=DEBUG

TODO

Faster initialization for big networks (pretty easy - just do on device)
Better data management (Right now only axbench support, and we use the parquets committed to GH). Need to migrate to using just Huggingface datasets completely
Safetensors
Unified/clean checkpointing and robust resume/fault tolerance in training
Proper implementation of data generation (TODO: jiuding)
Robust distributed training support via FSDP and DDP
Optional Fast Deepspeed inference kernels and revamped inference logic with distributed data parallel support
Optional Liger kernel, FA2, etc. for faster training

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
config		config
experiments		experiments
hypersteer		hypersteer
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HyperSteer

Setup

Usage

Environment Setup

TODO

About

Uh oh!

Releases

Packages

Languages

HyperStuff/hypersteer

Folders and files

Latest commit

History

Repository files navigation

HyperSteer

Setup

Usage

Environment Setup

TODO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages