Design

PostgreSQL migration management tool.

I like to lean heavily on the database, and therefore don't like to use tools that get in the way of that. There are plenty of other migration solutions out there, but this one solves problems that I've faced. A tool like sqitch was close to my needs, but didn't handle testing and reusing of components in the way I would ultimately like. Also, because of its dependence on PostgreSQL's variables, it prevented me from using variables in the way that I have been exploring for multi-tenant databases.

The main goal of Spawn is to make it easy to have reusable components, and keep track of changes as you modify your components over time, while sacrificing as little raw power of the database as possible. I'm not interested in abstractions, but rather tools that make it easier to work with the full breadth of features offered by modern databases.

Design goals:

When updating functions or (some) views, you edit the file in the components folder for it in place, and reference that component in your new migration script. This ensures that git diffs will show what's changed in the function, rather than being a fresh copy.
A record of the components used in a migration script are kept as they were at that time. This helps in case there is a need to return to an earlier migration script and update it for whatever reason. The version of the component at that time can be used, instead of the current version.
Support for variables, so that things such as schema names can be configurable (e.g., generating migrations for multiple tenancies in a schema-per-tenant setup).
Support plain PostgreSQL SQL, so that we aren't locked out of any database features.
Track migrations that have been applied to database in a table.

For now, the focus is on PostgreSQL.

Design

In your root project you will need a spawn.toml configuration file. It will point to a spawn_folder which is where your migrations, components, and tests go. In that folder we have:

/components. This contains standalone SQL snippets that can be modified and reused in migrations and tests. These are minijinja templates, which could contain just pure SQL. The goal is to have proper change tracking for these, so that we can look at the history in git for that file and see how it has changed over time.
/migrations. This folder contains your migrations, one folder for each. E.g., 20240802030220-support-roles/up.sql. These are minijinja templates, designed to produce plain SQL migration scripts. In these templates, you can import components.
/tests. This contains all your tests, and works in a similar way to migrations, but with some helpful commands to work with them.
/pinned. This folder contains a copy of files as they were at a particular time that the migration was made stored by hash. It works in a similar way to git, and it is intended that you commit this folder to your repo. This allows migrations to be rerun/recreated as they were at that time, even if a referenced component has changed. Check Pinning below for more details.

Commands

Some commands available:

spawn init
spawn migration new <name> creates a new migration with a timestamp and the name.
spawn migration pin <migration> pins the migration with the current components, creating objects in the /pinned folder and a lock.toml in the migration folder to point to the correct pinned files.
spawn migration build <migration> --pinned builds the migration into the needed SQL. --pinned is optional, and is included when you want to use the pinned files for a reproducible migration.
spawn migration build <migration> --pinned builds the migration into the needed SQL. --pinned is optional, and is included when you want to use the pinned files for a reproducible migration.
spawn test run <test> will use psql (as configured in spawn.toml psql_command value) to pipe the generated test to your database. See Testing below for more details.

Flags

--database allows you to specify which specific database from the config to use. Defaults to default_database in your config.

Examples

Printing out SQL:

spawn migration build 20240907212659-initial static/example.json

Multiple package migrations design

Provide a standardised way for a package to expose its db migrations. Then, when another package with a binary imports it, it ensures there is a way to run the binary so that it outputs the migrations in the standardised way.

The spawn tool can then be configured to call that binary too, and operate in all the usual ways for things that make sense (e.g., can't pin?). So you can see which migrations are unapplied, and which have been applied, etc.

Framework has public function that returns embedded migrations
Project that imports the framework has a subcommand that outputs the migrations when invoked in an expected format, to the terminal.
Migrator can be told this command, and will allow you to run normal migration commands except treating the output from this binary as a pseudo file system rather than actual folder.

Important: Make sure this supports variables, and custom passing in of schema, etc.

TODO: think about how to handle ordering. Scenario: someone uses another package with its own migrations, version 2.3.0. It then does its own migrations for a while on its own schema, some of which interact with tables from the other package. Later, they update the upstream package from 2.3.0 to 3.1.4, where a number of migrations need to be applied. Those migrations should be considered to take place after any local migrations that have been done to date. How do we handle this? E.g., do we have a local file in repo that specifies what order to apply upstream package migrations? Maybe when you run it, it can check existing migrations, and specify new ones to be done at that point.

Pinning

In order to ensure that earlier migrations can be run with the same source components, we support pinning. What this does is take a snapshot of the components folder as it is at that moment, and stores a reference to that snapshot in the migration folder. From then on, the migration can be run using the pinned version.

Under the hood, this works in a similar way to git, storing copies of files in the /pinned subfolder, with filenames matching their hash. The list of files and their hashes are stored as tree objects in the same /pinned folder, and the migration's config points to the root tree for its snapshot.

It is intended that you commit the /pinned folder to your repository.

To pin:

spawn migration pin <migration name>

Testing

For now, we will support only postgres for testing. Testing will require psql to be available.

Allow configuration of how to invoke psql.
Provide scaffolding for automatic use of create database with template .
- Configurable whether failed or successful tests tear down the test database or not.
Follow the postgres testing style of producing an expected output, and then comparing future runs to that expected output with a diff.

Stores

There are two different goals we want to achieve:

Allow files to come from a variety of sources (filesystem, embedded in binary, remote server).
Allow us to use either pinned components or current components.

Ideally then we can choose any combination of these two things. Our pin choice (none, spawn, git) should be a separate choice from our files/directories fetching method (local, embedded, remote).

Features and plans

Here are some of my design goals with spawn:

Local commands

Handy commands for when running locally for testing:

cargo run migration build 20240907212659-initial

# Install into ~/.cargo/bin/spawn
cargo install --path .

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
.github		.github
src		src
static		static
tests		tests
.gitignore		.gitignore
CLA.md		CLA.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
spawn.toml		spawn.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Design

Commands

Flags

Examples

Multiple package migrations design

Pinning

Testing

Stores

Features and plans

Local commands

About

Uh oh!

Releases 2

Packages

Languages

License

saward/spawn

Folders and files

Latest commit

History

Repository files navigation

Design

Commands

Flags

Examples

Multiple package migrations design

Pinning

Testing

Stores

Features and plans

Local commands

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages