Refactor multi-segment submission

As discussed in many issues the current format of mapping metadata entries to sequences in the multi-segmented case is suboptimal. Here we proceed as voted for in microbioinfo: https://microbial-bioinfo.slack.com/archives/CB0HYT53M/p1760961465729399

Users can add an additional column `fastaId` to the metadata tsv with a space separated list of all the fasta headers that should be linked to that entry. If no such entry is supplied we fall back to using the `submissionId` and assume this is the same as the fasta header Id. 

Preprocessing will now assign the segment.

## Steps: 

- [x] Migration of sequence compression format in backend: https://github.com/loculus-project/loculus/issues/3984?issue=loculus-project%7Cloculus%7C4769 - original unaligned sequences do not have an assigned segment
- [x] Refactor of how the backend joins sequences and metadata entries, will now send preprocessing originalData as a record from fastaHeader to sequence: https://github.com/loculus-project/loculus/pull/5398
- [x] Have preprocessing assign the segment using nextclade sort (config refactor) and also return a mapping of the fastaHeaderId to the segment https://github.com/loculus-project/loculus/pull/4783
- [x] Update the edit page to use the fastaHeader mapping and work correctly
- [ ] Migrate older data to have fastaHeader mapping
- [x] Before releasing: update CCHF example data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor multi-segment submission #5392

Steps:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Refactor multi-segment submission #5392

Description

Steps:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions