Network keys #250

Nospamas · 2026-01-14T22:44:53Z

Having chatted with Rod about this we came to the conclusion that being able to handle renaming networks while still being able to refer to them in some sort of human readable way is best served by adding a new immutable and unique column to the meta_network table. This column is network_key. This PR handles the changes required to add this column as well as resolve various issues arising from adding columns to a history enabled Pycds table. Main changes:

Migration that adds the table column and
Migration that allows us to maintain similar column orders by copying data to a new history table with the network_key column
Table ORM versioning that allows us to run tests against an older version of the database where the column may not exist yet

Info on the second point:

Rod's setup for history tracking uses a set of triggers that execute any time modifications are made to the base tables. These triggers are shared by all tables that the history tracking is applied to and rely on column order to insert the right information from the base table into the history. When adding a new column it is added to the end, we can add it to both tables but because history includes two extra columns: deleted and <table>_hx_id when trying to run the triggers with the new column they aren't in alignment. There are 3 main potential fixes for this:

1. use a copy on write type process: involves renaming the existing history table, creating a new one with the new column, copying the existing data over including its FK references and deleting the old table. Pros: Self contained, still uses existing history logic. Cons: Doesn't really fix the problem long term, prone to data loss if there are logical errors in my code when copying the data.
1. Update the triggers so that they properly use the column names instead of relying on order. Pros: fixes issue permenently. Cons: Triggers would have to be created per table, more work to re-work trigger generation functions in python
1. Rework triggers and table column orders so that history-unique columns are at the front of the table. This would allow triggers to rely on order when adding new columns as the extra ones in history would be dealt with first. Pros: Fixes the problem permenently, keeps trigger code simple. Cons: Would require copying and regenerating all the tables in the database. This wouldn't be too bad for the metadata but would take several hours for obs_raw.

We've opted for #1 for now as it is pretty rare that it is necessary to add new columns to an existing table and is the pragmatic approach to this problem.

…etwork_hx table

QSparks

Looks good, just a few minor comments.
I am not familiar with the database internals, but the migration logic looks sound.

One possible edge case: network_name is nullable in the schema. If any rows have NULL network_name, they'll get NULL network_key values (which PostgreSQL's unique constraint allows). Do we ever have NULL network names in practice? If so, should we validate that all network names are populated?

tests/climate_baseline_helpers/test_climate_baseline_helpers.py

tests/alembic_migrations/versions/v_bdc28573df56_add_obs_raw_indexes/test_smoke.py

...s/alembic_migrations/versions/v_7a3b247c577b_add_varsperhistory_native_matview/test_smoke.py

pycds/orm/versioning.py

pycds/orm/tables/__init__.py

jameshiebert

Just a couple small changes to consider. I'll let you be the judge of their practicality.

jameshiebert · 2026-01-29T19:44:59Z

pycds/alembic/versions/33179b5ae85a_add_network_key_column_to_meta_network.py

+        )
+    )
+
+    # Drop existing triggers before modifying table structure so that we don't accidentally track


Near as I can tell, you got the sequence right for dropping/recreating all of the triggers/constraints/FKs. Not sure if this is possible, but could we disable the triggers or defer them, so we don't have to recreate them by definition? I feel like it would make the code less verbose, and give us the assurance that we're not changing the definition (unless that's what we want).

I'm not sure it changes verbosity too much, but I've added some functions to enable and disable this trigger rather than removing it.

I'm fairly certain the code in its current state is safe, but if someone were to change the underlying functions that created and removed these triggers there is the potential for unexpected definition change.

jameshiebert · 2026-01-29T19:46:15Z

pycds/alembic/versions/33179b5ae85a_add_network_key_column_to_meta_network.py

+        sa.Column(
+            "network_key",
+            sa.String(),
+            nullable=True,


To Quintin's point, network_name, though currently NULLable, shouldn't be (and in practice, never is), and I think network_key should not be nullable either. What good is a key that's NULL :)

I agree but I think I'm I'm stuck in a catch 22 here. I can't add a new column with out it being either a) nullable or b) have a default value applied. Default values in postgres need to be pretty simple and can't refer to other column data even via functions.

In order to circumvent this the code is as currently applied: The column is created as a nullable but a trigger will populate this column any time an insert happens acting as a default.

…on in _init_.py for tables

Nospamas added 3 commits January 14, 2026 18:34

Add migrations to add "network_key" column to allow network renames

e2e131f

black project

8e1197f

tweak to import logic for better support on python 3.10

0a8378b

Nospamas force-pushed the network_keys branch from 54b4067 to 0a8378b Compare January 14, 2026 23:09

tweak to how the sequence is updated to avoid interference from new n…

8043590

…etwork_hx table

Nospamas force-pushed the network_keys branch from 9d9d948 to 8043590 Compare January 16, 2026 17:53

Nospamas requested review from QSparks and jameshiebert January 22, 2026 18:31

QSparks reviewed Jan 27, 2026

View reviewed changes

Nospamas mentioned this pull request Jan 29, 2026

Network keys pacificclimate/crmprtd#189

Open

jameshiebert reviewed Jan 29, 2026

View reviewed changes

Nospamas added 6 commits January 29, 2026 20:33

enable and disable history triggers instead of removal

ae64a61

Drop specific version default in favour of centralizing default versi…

48d559d

…on in _init_.py for tables

Adjust mock tests so they're properly testing the correct function

d9ace5c

blackify

d736e48

Better error messages when we import an unexpected revision for tables

9ef1ba9

remove incorrect comment

8f19e53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Network keys #250

Network keys #250

Uh oh!

Nospamas commented Jan 14, 2026 •

edited

Loading

Uh oh!

QSparks left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jameshiebert left a comment

Uh oh!

jameshiebert Jan 29, 2026

Uh oh!

Nospamas Jan 29, 2026

Uh oh!

jameshiebert Jan 29, 2026

Uh oh!

Nospamas Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Network keys #250

Are you sure you want to change the base?

Network keys #250

Uh oh!

Conversation

Nospamas commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

QSparks left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jameshiebert left a comment

Choose a reason for hiding this comment

Uh oh!

jameshiebert Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Nospamas Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

jameshiebert Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Nospamas Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Nospamas commented Jan 14, 2026 •

edited

Loading