Skip to content

feat(preprocessing): preserve attached files through nextclade pipeline#6008

Open
theosanderson-agent wants to merge 1 commit intomainfrom
nextclade-file-preservation
Open

feat(preprocessing): preserve attached files through nextclade pipeline#6008
theosanderson-agent wants to merge 1 commit intomainfrom
nextclade-file-preservation

Conversation

@theosanderson-agent
Copy link
Collaborator

@theosanderson-agent theosanderson-agent commented Feb 19, 2026

Summary

Extracted from #5942 per @chaoran-chen's feedback — this PR includes only the new Nextclade pipeline feature (file preservation) without removing the dummy pipeline or organisms.

What's included

  • Nextclade preprocessing: Files attached to submissions are now preserved through the processing pipeline (backend.py, datatypes.py, prepro.py)
  • test-organism-files: Renamed from dummy-organism-with-files with inlined metadata and switched to nextclade preprocessing
  • Integration tests: Updated file-sharing.spec.ts and submission.page.ts for the rename

What's NOT included (deferred to follow-up)

  • Removal of dummy-organism and preprocessing/dummy/
  • Test changes replacing dummy-organism with ebola-sudan
  • Workflow/CI changes removing dummy pipeline image builds
  • Doc updates

This allows the new feature to be validated in production before the old dummy pipeline is dropped.

Test plan

  • Integration tests pass (especially file-sharing.spec.ts)
  • Preprocessing tests pass

🤖 Generated with Claude Code
via Happy

🚀 Preview: Add preview label to enable

Add file preservation to the nextclade preprocessing pipeline so that
files attached to submissions are carried through to the processed output.
This enables the nextclade pipeline to replace the dummy pipeline for
organisms that use file uploads.

Changes:
- datatypes.py: Add `files` field to `UnprocessedData`
- backend.py: Extract files from backend JSON response
- prepro.py: Pass files through to `ProcessedData` in no-alignment path
- values.yaml: Rename `dummy-organism-with-files` to `test-organism-files`
  with inlined metadata and nextclade preprocessing
- Update file-sharing integration tests for the rename

The dummy pipeline and organisms are intentionally kept for now and will
be removed in a follow-up PR after validation.

Generated with [Claude Code](https://claude.ai/code)
via [Happy](https://happy.engineering)

Co-Authored-By: Claude <noreply@anthropic.com>
Co-Authored-By: Happy <yesreply@happy.engineering>
@claude claude bot added preprocessing Issues related to the preprocessing component deployment Code changes targetting the deployment infrastructure labels Feb 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

deployment Code changes targetting the deployment infrastructure preprocessing Issues related to the preprocessing component

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants