Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
172 commits
Select commit Hold shift + click to select a range
e4eea9d
add prefix to singularity container for report
nschan May 22, 2025
e148a95
Merge branch 'nf-core:dev' into dev
nschan May 22, 2025
51281c7
Merge branch 'nf-core:dev' into dev
nschan Jun 2, 2025
5c92346
Merge branch 'nf-core:dev' into dev
nschan Jun 26, 2025
5c836b1
Merge branch 'nf-core:dev' into dev
nschan Jul 1, 2025
268a29b
Merge branch 'nf-core:dev' into dev
nschan Jul 1, 2025
0202358
Merge branch 'nf-core:dev' into dev
nschan Jul 2, 2025
4de24cc
Merge branch 'nf-core:dev' into dev
nschan Jul 11, 2025
4e98303
maxime comments
nschan Jul 11, 2025
376973f
comments Daniel Lundin
nschan Jul 21, 2025
a8e9afc
update release date
nschan Jul 21, 2025
5c5b2d2
Revert "comments Daniel Lundin"
nschan Jul 21, 2025
d343d61
Merge branch 'nf-core:dev' into dev
nschan Oct 24, 2025
4ff031d
Merge branch 'nf-core:dev' into dev
nschan Oct 24, 2025
ce7a5a0
Merge branch 'nf-core:dev' into dev
nschan Feb 13, 2026
484c1d2
prepare pipeline initialization for sample-wise parameterization
nschan Jun 27, 2025
ae122c6
refactor assemble and assemble subworkflows for sample-wise parameter…
nschan Jun 27, 2025
539f309
checks, typos
nschan Jul 1, 2025
f2e0ca7
Important! Template update for nf-core/tools v3.3.1 (#164)
nf-core-bot Jul 1, 2025
5579db4
refactor assemble and assemble subworkflows for sample-wise parameter…
nschan Jun 27, 2025
39b5d38
intermediate WIP commit
nschan Jul 2, 2025
97d2d13
WIP commit
nschan Jul 4, 2025
6b1c1d7
first attemp done
nschan Jul 4, 2025
daee6ee
bugfix commit 1
nschan Jul 7, 2025
2f95c5e
bugfix commit 2
nschan Jul 7, 2025
faddbdb
bugfix commit 3
nschan Jul 7, 2025
d428ddb
report modifications 1; add groups
nschan Jul 7, 2025
cbe5eea
WIP commit
nschan Jul 8, 2025
d24eb8e
WIP commit
nschan Jul 8, 2025
cd198cd
bugfix commit, update configs for sample-wise args
nschan Jul 9, 2025
896617b
running up until including medaka
nschan Jul 9, 2025
54fdedc
running up until including pilon
nschan Jul 9, 2025
a66f51c
running up until including pilon (fixed)
nschan Jul 9, 2025
cd5c8eb
running up until report, schema update
nschan Jul 9, 2025
7ce8e7b
docs update
nschan Jul 9, 2025
a475837
working pipeline (no report
nschan Jul 9, 2025
b789420
adding additional control for merqury, nanoq, for reporting; WIP
nschan Jul 10, 2025
3f26436
refactor assemble and assemble subworkflows for sample-wise parameter…
nschan Jun 27, 2025
0069ed2
Important! Template update for nf-core/tools v3.3.1 (#164)
nf-core-bot Jul 1, 2025
9468b09
params.md
nschan Jul 11, 2025
52708b6
less blocking
nschan Jul 11, 2025
8b4b157
wip commit
nschan Jul 21, 2025
14c129f
use groups in prepare
nschan Jul 29, 2025
e52f31d
cleanup
nschan Jul 29, 2025
eba981c
switch to fastplong
nschan Aug 7, 2025
93d3dd2
fastplong report
nschan Aug 7, 2025
e745fd1
bugfixes and schema update
nschan Aug 8, 2025
406e2e9
report updating
nschan Aug 26, 2025
2dae481
finish refactoring, update report for fastplong
nschan Aug 28, 2025
8508e82
report fixes
nschan Aug 28, 2025
f7125b6
refactor assemble and assemble subworkflows for sample-wise parameter…
nschan Jun 27, 2025
69e94a9
Important! Template update for nf-core/tools v3.3.1 (#164)
nf-core-bot Jul 1, 2025
3b3948e
less blocking
nschan Jul 11, 2025
c3e2ae3
Improve reference input check (#166)
nschan Jun 26, 2025
d7f0d52
Important! Template update for nf-core/tools v3.3.1 (#164)
nf-core-bot Jul 1, 2025
47d656e
prepare pipeline initialization for sample-wise parameterization
nschan Jun 27, 2025
419eed6
refactor assemble and assemble subworkflows for sample-wise parameter…
nschan Jun 27, 2025
65cdac7
Important! Template update for nf-core/tools v3.3.1 (#164)
nf-core-bot Jul 1, 2025
9cc7003
less blocking
nschan Jul 11, 2025
4fadcf1
revert rebase
nschan Aug 28, 2025
575654d
some more conflicts
nschan Aug 28, 2025
f26abb0
reverting changes
nschan Aug 28, 2025
3be0f40
back to working state
nschan Aug 29, 2025
a03842e
docs update
nschan Aug 29, 2025
1f6f2f9
add check if longreads should go into fastplong
nschan Aug 29, 2025
3e528ef
bug collection
nschan Sep 4, 2025
67802f4
remove jellyfish dump
nschan Sep 4, 2025
efb4dba
report grouping fixes
nschan Sep 11, 2025
e324ac9
no more plotly for merqury plots
nschan Sep 11, 2025
79ecc1e
fix report grouping
nschan Sep 19, 2025
3e45625
add comments to main and assemble workflow
nschan Sep 19, 2025
f89fe8f
fix flye-flye scaffolding code
nschan Oct 1, 2025
16f2f22
update report format, refactor document, document format
nschan Oct 23, 2025
26aacd4
update assemble subworkflow: split flye assembly process by read type
nschan Oct 23, 2025
71d8300
remove schema.md
nschan Oct 23, 2025
7cf82b4
minor cleanup
nschan Oct 23, 2025
acffbdc
remove param from quast module
nschan Nov 24, 2025
1c8c7d0
switch to fastp for shortread trimming
nschan Nov 24, 2025
723e865
small update quast SW
nschan Nov 24, 2025
0cc1bae
Bulk change: Channel.emtpy -> channel.empty
nschan Nov 26, 2025
886cecb
switch to fastp, cleanup QC sw
nschan Nov 26, 2025
bcc5fe3
remove old config presets
nschan Nov 26, 2025
b10a791
remove ci.yml
nschan Nov 26, 2025
99cfebf
bulk module update
nschan Dec 9, 2025
3e77ece
fastp config, remove trimgalore
nschan Dec 9, 2025
18f303f
update rocrate
nschan Dec 9, 2025
36c3f54
improve assembler filtering logic
nschan Dec 9, 2025
1697bde
update CHANGELONG
nschan Dec 9, 2025
7df7552
fix validation during pipeline initialisation
nschan Dec 9, 2025
bc6ab4a
update qc read path after fastplong
nschan Dec 9, 2025
926bf41
fix scaffolding inputs
nschan Dec 9, 2025
92711ed
update qc_read_path management during input
nschan Dec 9, 2025
71bdee6
fix links setting
nschan Dec 9, 2025
c12bb98
test snapshot
nschan Dec 10, 2025
803b66f
remove deprecated params from config
nschan Dec 10, 2025
52c5aaf
paths for reads in full_config
nschan Dec 10, 2025
161cc63
update nftignore
nschan Dec 10, 2025
048421c
empty adaptors for fastp
nschan Dec 11, 2025
c9251a9
improve short-read logic
nschan Dec 11, 2025
5fb4b72
wip full_test
nschan Dec 11, 2025
9471bc3
remove fastp TODO
nschan Dec 11, 2025
a7de4d0
improve sample parameterisation
nschan Dec 16, 2025
3b842d5
update param defaults, schema
nschan Jan 13, 2026
2013320
improve channel creation
nschan Jan 13, 2026
35ff8f0
new test config
nschan Jan 13, 2026
0f8d169
report update
nschan Jan 13, 2026
5e6924a
fix bug where samples that had flye in assembler1 and assembler2 were…
nschan Jan 13, 2026
ecc759c
fix typo in channel name
nschan Jan 13, 2026
f8499fd
add busco lineage for test-data
nschan Jan 15, 2026
403e1bb
fix bug where flye samples were not joined back correctly; mix channe…
nschan Jan 15, 2026
6799326
indentation - good, bad and ugly?
nschan Jan 15, 2026
a03f2f4
move all sample information into meta, remove joins that are now obso…
nschan Jan 23, 2026
e9322ee
use csi index, add global param csi_index_size, reported via slack by…
nschan Jan 27, 2026
465b888
finish pipeline refactor towards meta-stuffing
nschan Jan 28, 2026
2a63c88
add dorado polish
nschan Jan 29, 2026
39bd223
integrate dorado into pipeline, add documentation
nschan Jan 29, 2026
ddf00a7
local dorado modules, no test, not properly implemented, dorado not s…
nschan Feb 3, 2026
bb3147d
refactored workflow, finishes full_test successfully
nschan Feb 3, 2026
0e1964c
update report for new busco output
nschan Feb 3, 2026
152a418
some documentation and comments
nschan Feb 3, 2026
3092b2a
add funding information to README
nschan Feb 3, 2026
353a08d
add hic workflow
nschan Feb 4, 2026
e1b8649
fix test
nschan Feb 5, 2026
11cf0cd
config for hic
nschan Feb 10, 2026
e2acf39
documentation
nschan Feb 10, 2026
45a4442
fix issues in handling of HiC data
nschan Feb 10, 2026
7f3c155
add addorreplacereadgroups module
nschan Feb 10, 2026
e01d9d6
update modules
nschan Feb 10, 2026
3d40b0b
update pipeline schema
nschan Feb 10, 2026
9bb1748
update report, sort version issues. rebase after this
nschan Feb 13, 2026
3f06337
update modules, handle topic
nschan Feb 13, 2026
d8582ad
remove ci.yml
nschan Feb 13, 2026
582a936
fix broken merge
nschan Feb 13, 2026
2991bd5
remove trace timestamp
nschan Feb 13, 2026
ccec260
update test
nschan Feb 13, 2026
97105a0
fix nf-test.yml
nschan Feb 13, 2026
88fde2f
fix ext.args for minimap2 going into samtools index
nschan Feb 13, 2026
55a571b
align nf-test.yml with TEMPLATE
nschan Feb 13, 2026
6808b04
tools stuff
nschan Feb 13, 2026
98b4c4d
scope config properly
nschan Feb 16, 2026
20a2797
modules update
nschan Feb 16, 2026
228e35a
change test samplesheet
nschan Feb 16, 2026
96e3fc5
update changelog
nschan Feb 16, 2026
1886f34
switch workflows to versions topic, remove single process subworkflows
nschan Feb 16, 2026
452d475
update modules for versions topic
nschan Feb 17, 2026
22cbd74
clean up versions, fix busco include
nschan Feb 18, 2026
5cc633b
finish transition to versions topic, new test snapshot
nschan Feb 18, 2026
494fd27
linting fixes
nschan Feb 18, 2026
4e3f20a
linting fixes
nschan Feb 18, 2026
1f11419
linting fixes
nschan Feb 18, 2026
c725511
reinstall fastplong without patch
nschan Feb 18, 2026
59e6bb1
more files unchanged
nschan Feb 18, 2026
2c08e3a
fix bug in assembler argument resolution that may affect test
nschan Feb 18, 2026
5c71d57
fix versions in gfa2fa, update test
nschan Feb 18, 2026
7765a61
sort out assembler arg approach update test
nschan Feb 19, 2026
b5949d3
exclude unstable files
nschan Feb 19, 2026
4cf59b8
new metromap
nschan Feb 19, 2026
edef5f6
remove params.md
nschan Feb 19, 2026
43fb04f
replace combine with join in polish_pilon
nschan Feb 19, 2026
df608fa
remove medaka from full_test
nschan Feb 19, 2026
8295789
add test_full_medaka
nschan Feb 19, 2026
47e5892
update readme
nschan Feb 19, 2026
d23f008
update CITATIONS
nschan Feb 20, 2026
83bdff7
update bsky-post action version
nschan Feb 20, 2026
0923ec6
remove NOTE
nschan Feb 20, 2026
5abe752
remove mystery characters
nschan Feb 20, 2026
8257711
fix headers in output.md
nschan Feb 20, 2026
86bca4b
fix indentation in nf-test.config
nschan Feb 20, 2026
651a914
remove debug leftovers
nschan Feb 20, 2026
a8fd65f
update report group input to use group from meta
nschan Feb 20, 2026
aedc9c8
remove additional debug statement
nschan Feb 20, 2026
4f69670
rename scaffold workflow folder, clean up comments
nschan Feb 20, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .github/CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,7 @@ If you wish to contribute a new step, please use the following coding standards:
6. Add sanity checks and validation for all relevant parameters.
7. Perform local tests to validate that the new code works as expected.
8. If applicable, add a new test in the `tests` directory.
9. Add a description of the output files and if relevant any appropriate images from the MultiQC report to `docs/output.md`.

### Default values

Expand Down
95 changes: 0 additions & 95 deletions .github/workflows/ci.yml

This file was deleted.

2 changes: 1 addition & 1 deletion .github/workflows/nf-test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -78,7 +78,7 @@ jobs:
- isMain: false
profile: "singularity"
NXF_VER:
- "25.04.0"
- "25.10.0"
- "latest-everything"
env:
NXF_ANSI_LOG: false
Expand Down
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -9,3 +9,4 @@ testing*
null/
.nf-test/
.nf-test.log
schema.md
5 changes: 4 additions & 1 deletion .nf-core.yml
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,9 @@ lint:
files_unchanged:
- assets/sendmail_template.txt
- .github/CONTRIBUTING.md
- assets/nf-core-genomeassembler_logo_light.png
- docs/images/nf-core-genomeassembler_logo_light.png
- docs/images/nf-core-genomeassembler_logo_dark.png
nf_core_version: 3.5.1
repository_type: pipeline
template:
Expand All @@ -20,4 +23,4 @@ template:
skip_features:
- multiqc
- igenomes
version: 1.1.0
version: 2.0.0dev
2 changes: 0 additions & 2 deletions .prettierignore
Original file line number Diff line number Diff line change
Expand Up @@ -14,5 +14,3 @@ bin/
ro-crate-metadata.json
modules/nf-core/
subworkflows/nf-core/
*.svg
tests/
49 changes: 49 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,53 @@
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/)
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## v2.0.0 'Saffron Vulture' - [2026-xx-xx]

This is a major release, with breaking changes.
v2.0.0 of genomeassembler is a large refactor of the pipeline to facilitate sample-level parameteristation. This allows to either parameterise the _pipeline_ using `params`, or parameterise _samples_ via the `input` samplesheet. In case both types of parameterisations are used, sample parameters will take priority.

Since this workflow follows a sample-centric implementation, nextflow will always render the full pipeline dag, but depending on configuration samples may not travel through the whole pipeline. This may also cause terminal output to show task instances that will never become an active process.

In addition, v2.0.0 contains these changes:

### `Added`

- fastplong for long-read trimming and qc
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I know it is a bit difficult to do now, but in general it would be nice to include the PR# along each added feature (and the git handle of the person(s) who worked on it)

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, fully agree. This all a massive PR, sadly there is only one person working on this (me).

- fastp for short-read trimming and qc
- migration to nf-test
- increased flexibility of the scaffolding strategy
- added option to group samples
- `dorado polish` added as an alternative to `medaka` for ONT polishing. This is an **experimental feature**, due to `dorado` being under active development.
- HiC scaffolding subworkflow:
- mapping with `bwamem2` or `minimap2`
- duplicate removal with `picard`
- scaffolding with `yahs`
- Switched to the versions topic, requires nextflow >=25.10.0

### `Fixed`

### `Dependencies`

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you use the nf-core dependencies table for this, along with versions:
example:

| Tool    | Previous version | New version |
| ------- | ---------------- | ----------- |
| multiqc | 1.32             | 1.33        |
| cmaple  | -                | 1.1.0       |

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How do I get this nf-core dependencies table?

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Manual labor xD

- `fastplong`
- `fastp`
- `dorado`
- `bwamem2`
- `picard`
- `yahs`

### `Deprecated`

The following tools are no longer used:

- `nanoq`
- `porechop`
- `lima`
- `trimgalor`

The following param is no longer implemented:

- `dump`, used to dump jellyfish output.

## v1.1.0 'Brass Pigeon' - [2025-07-21]

### `Added`
Expand Down Expand Up @@ -72,3 +119,5 @@ Initial release of nf-core/genomeassembler, created with the [nf-core](https://n
### `Dependencies`

### `Deprecated`

Codenames for v1.x are various types of metallic pigeons, v2.x are vultures of different colors.
30 changes: 19 additions & 11 deletions CITATIONS.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,19 +12,11 @@

### Preprocessing

- [lima](https://github.com/pacificbiosciences/barcoding/)
- [fastp](https://github.com/OpenGene/fastp/) and [fastplong](https://github.com/OpenGene/fastplong/)

- [nanoq](https://github.com/esteinig/nanoq)
> Shifu Chen. 2023. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta 2: e107. https://doi.org/10.1002/imt2.107

> Steinig and Coin (2022). Nanoq: ultra-fast quality control for nanopore reads. Journal of Open Source Software, 7(69), 2991, https://doi.org/10.21105/joss.02991

- [porechop](https://github.com/rrwick/Porechop)

> Wick RR, Judd LM, Gorrie CL, Holt KE. Completing bacterial genome assemblies with multiplex MinION sequencing. Microb Genom. 2017;3(10):e000132. Published 2017 Sep 14. doi:10.1099/mgen.0.000132

- [TrimGalore](https://github.com/FelixKrueger/TrimGalore)

> Felix Krueger, Frankie James, Phil Ewels, Ebrahim Afyounian, Michael Weinstein, Benjamin Schuster-Boeckler, Gert Hulselmans, & sclamons. (2023). FelixKrueger/TrimGalore. Zenodo. https://doi.org/10.5281/zenodo.7598955
> Shifu Chen, Yanqing Zhou, Yaru Chen, Jia Gu; fastp: an ultra-fast all-in-one FASTQ preprocessor, Bioinformatics, Volume 34, Issue 17, 1 September 2018, Pages i884–i890, https://doi.org/10.1093/bioinformatics/bty560

### Assembly

Expand Down Expand Up @@ -52,6 +44,8 @@

- [medaka](https://github.com/nanoporetech/medaka)

- [dorado](https://github.com/nanoporetech/dorado)

### Scaffolding

- [LINKS](https://github.com/bcgsc/LINKS)
Expand All @@ -62,6 +56,10 @@

> Coombe L, Li JX, Lo T, Wong J, Nikolic V, Warren RL and Birol I. LongStitch: high-quality genome assembly correction and scaffolding using long reads. BMC Bioinformatics 22, 534 (2021). https://doi.org/10.1186/s12859-021-04451-7

- [yahs](https://github.com/c-zhou/yahs)

> Chenxi Zhou, Shane A. McCarthy, Richard Durbin. YaHS: yet another Hi-C scaffolding tool. Bioinformatics, 39(1), btac808.

- [RagTag](https://github.com/malonge/RagTag)

> Alonge, Michael, et al. "Automated assembly scaffolding elevates a new tomato system for high-throughput genome editing." Genome Biology (2022). https://doi.org/10.1186/s13059-022-02823-7
Expand Down Expand Up @@ -108,6 +106,16 @@

> Petr Danecek, James K Bonfield, Jennifer Liddle, John Marshall, Valeriu Ohan, Martin O Pollard, Andrew Whitwham, Thomas Keane, Shane A McCarthy, Robert M Davies, Heng Li (2021) Twelve years of SAMtools and BCFtools. GigaScience, Volume 10, Issue 2, February 2021, giab008, https://doi.org/10.1093/gigascience/giab008

- [bwa-mem2](https://github.com/bwa-mem2/bwa-mem2)

> Vasimuddin Md, Sanchit Misra, Heng Li, Srinivas Aluru. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. IEEE Parallel and Distributed Processing Symposium (IPDPS), 2019.

> Li H. (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997v2 [q-bio.GN]. (if you use the BWA-MEM algorithm or the fastmap command, or want to cite the whole BWA package)

- [picard](https://broadinstitute.github.io/picard/)

> “Picard Toolkit.” 2019. Broad Institute, GitHub Repository. https://broadinstitute.github.io/picard/; Broad Institute

## Software packaging/containerisation tools

- [Anaconda](https://anaconda.com)
Expand Down
33 changes: 11 additions & 22 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@
[![GitHub Actions Linting Status](https://github.com/nf-core/genomeassembler/actions/workflows/linting.yml/badge.svg)](https://github.com/nf-core/genomeassembler/actions/workflows/linting.yml)[![AWS CI](https://img.shields.io/badge/CI%20tests-full%20size-FF9900?labelColor=000000&logo=Amazon%20AWS)](https://nf-co.re/genomeassembler/results)[![Cite with Zenodo](http://img.shields.io/badge/DOI-10.5281/zenodo.XXXXXXX-1073c8?labelColor=000000)](https://doi.org/10.5281/zenodo.14986998)
[![nf-test](https://img.shields.io/badge/unit_tests-nf--test-337ab7.svg)](https://www.nf-test.com)

[![Nextflow](https://img.shields.io/badge/version-%E2%89%A525.04.0-green?style=flat&logo=nextflow&logoColor=white&color=%230DC09D&link=https%3A%2F%2Fnextflow.io)](https://www.nextflow.io/)
[![Nextflow](https://img.shields.io/badge/version-%E2%89%A525.10.0-green?style=flat&logo=nextflow&logoColor=white&color=%230DC09D&link=https%3A%2F%2Fnextflow.io)](https://www.nextflow.io/)
[![nf-core template version](https://img.shields.io/badge/nf--core_template-3.5.1-green?style=flat&logo=nfcore&logoColor=white&color=%2324B064&link=https%3A%2F%2Fnf-co.re)](https://github.com/nf-core/tools/releases/tag/3.5.1)
[![run with conda](http://img.shields.io/badge/run%20with-conda-3EB049?labelColor=000000&logo=anaconda)](https://docs.conda.io/en/latest/)
[![run with docker](https://img.shields.io/badge/run%20with-docker-0db7ed?labelColor=000000&logo=docker)](https://www.docker.com/)
Expand All @@ -21,44 +21,31 @@

## Introduction

**nf-core/genomeassembler** is a bioinformatics pipeline that carries out genome assembly, polishing and scaffolding from long reads (ONT or pacbio). Assembly can be done via `flye` or `hifiasm`, polishing can be carried out with `medaka` (ONT), or `pilon` (requires short-reads), and scaffolding can be done using `LINKS`, `Longstitch`, or `RagTag` (if a reference is available). Quality control includes `BUSCO`, `QUAST` and `merqury` (requires short-reads).
Currently, this pipeline does not implement phasing of polyploid genomes or HiC scaffolding.
**nf-core/genomeassembler** is a bioinformatics pipeline that carries out genome assembly, polishing and scaffolding from long reads (ONT or pacbio). Assembly can be done via `flye` or `hifiasm`, or combinations of both, polishing can be carried out with `medaka` (ONT), `dorado` (ONT only, experimental) or `pilon` (requires short-reads), and scaffolding can be done using `LINKS`, `Longstitch`, both using long-reads, `yahs` if HiC reads are availble, or `RagTag` if a reference is available. Quality control includes `BUSCO`, `QUAST` and `merqury` (requires short-reads).
Currently, this pipeline does not implement phasing of polyploid genomes.

<picture>
<source media="(prefers-color-scheme: dark)" srcset="docs/images/genomeassembler.dark.png">
<img alt="nf-core/genomeassembler" src="docs/images/genomeassembler.light.png">
<img alt="nf-core/genomeassembler" src="docs/images/genomeassembler_v2.light.png">
</picture>

## Usage

> [!NOTE]
> If you are new to Nextflow and nf-core, please refer to [this page](https://nf-co.re/docs/usage/installation) on how to set-up Nextflow. Make sure to [test your setup](https://nf-co.re/docs/usage/introduction#how-to-run-a-pipeline) with `-profile test` before running the workflow on actual data.

First, prepare a samplesheet with your input data that looks as follows:
nf-core/genomeassembler can be set up via pipeline parameters, or via a samplesheet, or a combination of both. For more details and further functionality, please refer to the [usage documentation](https://nf-co.re/genomeassembler/usage) and the [parameter documentation](https://nf-co.re/genomeassembler/parameters).

`samplesheet.csv`:

```csv
sample,ontreads,hifireads,ref_fasta,ref_gff,shortread_F,shortread_R,paired
sampleName,ontreads.fa.gz,hifireads.fa.gz,assembly.fasta.gz,reference.fasta,reference.gff,short_F1.fastq,short_F2.fastq,true
```

Each row represents one genome to be assembled. `sample` should contain the name of the sample, `ontreads` should contain a path to ONT reads (fastq.gz), `hifireads` a path to HiFi reads (fastq.gz), `ref_fasta` and `ref_gff` contain reference genome fasta and annotations. `shortread_F` and `shortread_R` contain paths to short-read data, `paired` indicates if short-reads are paired. Columns can be omitted if they contain no data, with the exception of `shortread_R`, which needs to be present if `shortread_F` is there, even if it is empty.

Now, you can run the pipeline using:
The pipeline can be run with a test-profile via:

```bash
nextflow run nf-core/genomeassembler \
-profile <docker/singularity/.../institute> \
--input samplesheet.csv \
--outdir <OUTDIR>
-profile test,<docker/singularity/.../institute> \
```

> [!WARNING]
> Please provide pipeline parameters via the CLI or Nextflow `-params-file` option. Custom config files including those provided by the `-c` Nextflow option can be used to provide any configuration _**except for parameters**_; see [docs](https://nf-co.re/docs/usage/getting_started/configuration#custom-configuration-files).

For more details and further functionality, please refer to the [usage documentation](https://nf-co.re/genomeassembler/usage) and the [parameter documentation](https://nf-co.re/genomeassembler/parameters).

## Pipeline output

To see the results of an example test run with a full size dataset refer to the [results](https://nf-co.re/genomeassembler/results) tab on the nf-core website pipeline page.
Expand All @@ -67,12 +54,14 @@ For more details about the output files and reports, please refer to the

## Credits

nf-core/genomeassembler was originally written by [Niklas Schandry](https://github.com/nschan), of the Faculty of Biology of the Ludwig-Maximilians University (LMU) in Munich, Germany.
nf-core/genomeassembler was written, and is currently maintained by [Niklas Schandry](https://github.com/nschan), of the Faculty of Biology of the Ludwig-Maximilians University (LMU) in Munich, Germany, with funding support from the German Research Foundation (Deutsche Forschungsgemeinschaft [DFG], via Transregional Research Center TRR356 grant 491090170-A05 to Niklas Schandry).

I thank the following people for their extensive assistance and constructive reviews during the development of this pipeline:
I thank the following people for constructive reviews and discussion during the development of this pipeline:

- [Jim Downie](https://github.com/prototaxites)
- [Mahesh Binzer-Panchal](https://github.com/mahesh-panchal)
- [Matthias Hörtenhuber](https://github.com/mashehu)
- [Evangelos Karatzas](https://github.com/vagkaratzas)
- [Louis Le Nézet](https://github.com/LouisLeNezet)
- [Júlia Mir Pedrol](https://github.com/mirpedrol)
- [Daniel Straub](https://github.com/d4straub)
Expand Down
Binary file modified assets/nf-core-genomeassembler_logo_light.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
38 changes: 38 additions & 0 deletions assets/report/_brand.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
color:
palette:
green: "#24B064"
brown: "#3F2B29"
yellow: "#ECDC86"
white: "#f9f9f9"
gray-dark: "#343a40"
gray-100: "#f8f9fa"
gray-200: "#e9ecef"
gray-300: "#dee2e6"
gray-400: "#ced4da"
gray-500: "#adb5bd"
gray-600: "#6c757d"
gray-700: "#495057"
gray-800: "#343a40"
gray-900: "#212529"
background: white
foreground: gray-dark
primary: green
secondary: yellow
tertiary: brown
light: "#f8f9fa"
dark: "#212529"

logo:
images:
banner: "nf-core-logo.png"
apple: "nf-core-logo-square.png"
small: apple
medium: banner
large: banner

typography:
fonts:
- family: Maven Pro
source: google
base: Maven Pro
headings: Maven Pro
Loading
Loading