Reorder CSVs prior to S3 upload

**Overview**
Given the changes to using the Config Manager mean that we are generally appending data to the bottom, we should create a new Github Action to sort this data every time we commit to main. This will additionally help with debugging in Github as it will be easier to visually find data.

Each collection/pipeline file will need a bespoke ordering. These are listed below (or will be). We should also ensure that the CSVs for each file have the same header order (i.e. the column CSV for Dataset A doesn't have a different order than the column CSV for Dataset B). We may need to check with @pooleycodes before changing the order of these as it may impact the generation of new lines via the Add Data process. The default ordering for new collections is set in [create_collection.py](https://github.com/digital-land/config/blob/main/create_collection.py), but we'd have to manually change any existing files.

- [x] collection/endpoint.csv - Sort by entry-date (asc), then endpoint (asc)
- [ ] collection/old-resource.csv - No sorting necessary
- [x] collection/source.csv - Sort by entry-date (asc), then endpoint (asc)
- [x] pipeline/column.csv - Sort by dataset, then endpoint, then resource, then field
- [x] pipeline/combine.csv - Sort by dataset, then endpoint, then field
- [x] pipeline/concat.csv - Sort by dataset, then endpoint, then resource, then field
- [ ] pipeline/convert.csv - No sorting necessary - may be deleted following #2326 
- [x] pipeline/default-value.csv - Sort by dataset, then field
- [x] pipeline/default.csv - Sort by dataset, then field, then default-field
- [x] pipeline/entity-organisation.csv - Sort by dataset, then organisation, then entity-minimum
- [x] pipeline/expect.csv - Sort by dataset, then operation, then organisations
- [x] pipeline/filter.csv - Sort by dataset, then endpoint, then field
- [x] pipeline/lookup.csv - Sort by prefix then entity
- [x] pipeline/old-entity.csv - Sort by old-entity
- [x] pipeline/patch.csv - Sort by dataset, then endpoint, then field
- [x] pipeline/skip.csv - Sort by dataset, then endpoint, then pattern
- [x] pipeline/transform.csv - Sort by dataset, then replacement-field

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reorder CSVs prior to S3 upload #2131

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reorder CSVs prior to S3 upload #2131

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions