You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Overview
Given the changes to using the Config Manager mean that we are generally appending data to the bottom, we should create a new Github Action to sort this data every time we commit to main. This will additionally help with debugging in Github as it will be easier to visually find data.
Each collection/pipeline file will need a bespoke ordering. These are listed below (or will be). We should also ensure that the CSVs for each file have the same header order (i.e. the column CSV for Dataset A doesn't have a different order than the column CSV for Dataset B). We may need to check with @pooleycodes before changing the order of these as it may impact the generation of new lines via the Add Data process. The default ordering for new collections is set in create_collection.py, but we'd have to manually change any existing files.
collection/endpoint.csv - Sort by entry-date (asc), then endpoint (asc)
collection/old-resource.csv - No sorting necessary
collection/source.csv - Sort by entry-date (asc), then endpoint (asc)
pipeline/column.csv - Sort by dataset, then endpoint, then resource, then field
pipeline/combine.csv - Sort by dataset, then endpoint, then field
pipeline/concat.csv - Sort by dataset, then endpoint, then resource, then field
Overview
Given the changes to using the Config Manager mean that we are generally appending data to the bottom, we should create a new Github Action to sort this data every time we commit to main. This will additionally help with debugging in Github as it will be easier to visually find data.
Each collection/pipeline file will need a bespoke ordering. These are listed below (or will be). We should also ensure that the CSVs for each file have the same header order (i.e. the column CSV for Dataset A doesn't have a different order than the column CSV for Dataset B). We may need to check with @pooleycodes before changing the order of these as it may impact the generation of new lines via the Add Data process. The default ordering for new collections is set in create_collection.py, but we'd have to manually change any existing files.