- Clone the repo from GitHub.
git clone https://github.com/zellerlab/vortex_light.git
- Create a conda environment with NextFlow, e.g. by using the provided
environment.yml.
cd vortex_light
conda env create -f environment.yml
conda activate vortex_light
-
Make a copy of the
config/run.configfile and adjust it to your environment. -
Run the pipeline
nextflow run /path/to/vortex_light/main.nf --input_dir /path/to/input_files --output_dir /path/to/output_dir -c /path/to/run.config
Note: Nextflow itself requires at least 5GB of memory.
This requires a local nextflow installation. If you don't have one, see Steps 1/2 above.
-
Make a local copy of the run configuration file and adjust to your environment.
-
Run the pipeline
nextflow run zellerlab/vortex_light --input_dir /path/to/input_files --output_dir /path/to/output_dir -c /path/to/run.config
Note: Nextflow itself requires at least 5GB of memory.
--input_dirshould be a folder with bam files or with gzipped fastq files. For fastq files, individual samples should be separated into individual folders.--output_dirisvlight_outin the local directory by default.--skip_<analysis>,--run_<analysis>skips, resp. explicitly requires execution of the specified analysis (pathseq,base_counts(read counts post pre-processing),kraken2)--publishModeallows to switch between various modes of how results files are placed in theoutput_dir(cf. NextFlow documentation)
kraken2can only run when the parameterkraken_databaseis set.pathseqcan only run when the parameterpathseq_databaseis set.
- Outputs
The output folder contains:
- a subdirectory per sample (named
<sample>) with- the kraken2 report
<sample>.kraken2_report.txt - the library size
<sample>.libsize.txt - pathseq output
<sample>.pathseq.bam<sample>.pathseq.bam.sgi<sample>.pathseq.score_metrics<sample>.pathseq.scores
- the kraken2 report
Note that by default, all files in the output folder are symlinks into the work dir! Before you delete the work dir, ensure you have dereferenced copies. Alternatively, change the --publishMode parameter to copy or link (if the target file system supports hard links).