Skip to content

[Feature]: Add workflow for downloading reference genomes #47

@MillironX

Description

@MillironX

Summary

There should be a way to pre-download all of the present genomes from NCBI so that YAVSAP can be run offline.

Added Features

Additional parameters

Propose any new parameters that would be needed to implement this feature,
including its name, type, and default value, and explain how it would be
consumed by the new/existing processes.

  • --download_genomes
    • Switches the pipeline to "genome download" mode, where it will not run any analysis, but will instead simply download and reformat the reference genome files from NCBI.

Additional output

  1. Reference genome fasta/GenBank file corresponding to --genome
  2. Strain reference genome fasta/GenBank file containing all of the sequences from --genome_list
  3. Strain reference genome TSV file that points to 2 in a format YAVSAP can recognize (i.e., from [Feature]: Add ability to use files as a reference genome #44, /data/yavsap/results/genomes.fasta:NC_01234.1).

These files should probably be output even if downloading the files directly in the pipeline (the way it's done now).

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions