The format is based on Keep a Changelog and this project adheres to Semantic Versioning.
[3.23.0] - 2026-02-27
Special thanks to the following for their contributions to the release:
- Albert Palleja
- Chase Mateusiak
- Christian Mertes
- Elad Herz
- Ezra Greenberg
- Gary Burnett
- Isaac Virshup
- Juliana Assis
- Marine Cambon
- Matthias Zepper
- Sebastian Schulz
- PR #1616 - Add Sylph for contamination detection.
- PR #1663 - Bump version after release 3.22.2
- PR #1664 - Add support for multiple rRNA removal tools (
--ribo_removal_tool): SortMeRNA (default), Bowtie2, and RiboDetector; enable BBSplit MultiQC reporting; add paired-end read grouping in MultiQC - PR #1665 - Bulk update modules/subworkflows; replace CUSTOM_GETCHROMSIZES with SAMTOOLS_FAIDX; update RSEQC modules with Wave containers for ARM compatibility; update ARM containers for RSEQC and UMITOOLS
- PR #1667 - Enhance RSEM output exports and remove unnecessary star_rsem check
- PR #1669 - Enable SeqKit stats MultiQC module for RiboDetector rRNA removal
- PR #1672 - Document star_rsem STAR aligner settings and customization
- PR #1677 - Apply Nextflow 25 strict syntax fixes
- PR #1685 - Add GPU-accelerated STAR alignment and mark duplicates using NVIDIA Parabricks rna_fq2bam (
--use_parabricks_star) - PR #1686 - Add support for uncompressed FASTQ input files (#1343)
- PR #1687 - Fix PREPARE_GENOME tests: use single aligner values, correct aligner assignments for index-specific tests, add Kallisto test, remove ineffective Sentieon tests, use file-names-only snapshots for all non-deterministic indices (STAR, Salmon, Kallisto, RSEM, HISAT2)
- PR #1688 - Significantly improved prokaryotic RNA-seq support: new
-profile prokaryoticfor bacterial/archaeal data with Bowtie2+Salmon alignment, GFFREAD transcript extraction for CDS-only annotations, and automatic STAR CDS configuration. We would like to thank Juliana Assis, Sebastian Schulz, Albert Palleja and Marine Cambon for their recommendations and extensive testing. - PR #1689 - Migrate to topic-based version reporting for nf-core modules/subworkflows and local modules
- PR #1694 - Replace local iGenomes STAR modules with nf-core module aliases and config-based container overrides
- PR #1696 - Fix offline mode by skipping
igenomes_basevalidation whenNXF_OFFLINE=true(#1690) - PR #1697 - Add tximport processing for RSEM outputs: gene/transcript length matrices, length-scaled counts, SummarizedExperiment objects, and DESeq2 QC using length-scaled counts (#1320). Note for
--aligner star_rsemusers: the merged count and TPM files at the output root (e.g.rsem.merged.gene_counts.tsv) are now produced by tximport, matching the Salmon/Kallisto pathway. The previous RSEM merge script outputs are preserved in thersem_merge_counts/subdirectory. - PR #1700 - Add FastQC step after BBSplit/rRNA filtering to provide QC metrics on reads used for alignment (#1276)
- PR #1704 - Include single-library samples in merged fastq output when
--save_merged_fastqis set (#748) - PR #1707 - Enable UMI deduplication with
--aligner star_rsem(#829) - PR #1708 - Allow
--skip_alignmentwith a pre-built--salmon_indexor--kallisto_indexwithout requiring--fasta(#1706) - PR #1709 - Update metro map renders with nf-metro v0.4.7; add source
.mmdfile and regeneration instructions to CONTRIBUTING.md - PR #1711 - Scope GPU container flags (
--gpus all,--nv) toprocess_gputasks only via per-processcontainerOptions, fixing failures on CPU-only nodes in mixed clusters (#1710) - PR #1713 - Add optional
seq_platformandseq_centersamplesheet columns for per-sample BAM read group tags (PL,CN). Read group assembly is handled via ext.args config closures rather than module inputs. Per-sampleseq_centeroverrides the global--seq_centerparameter. Based on PR #1701 by @c-mertes. - PR #1714 - Re-render metro map with nf-metro v0.5.2: improved label spacing, smaller file icon font, and section alignment fix
- PR #1715 - Bump version to 3.23.0 ahead of release
- PR #1716 - Add informative error when Salmon fails to produce strandedness output for auto-strandedness samples
- PR #1718 - Add
--seq_platformparameter for global sequencing platform BAM read group tag, by analogy with--seq_center - PR #1719 - Docs and changelog cleanup: move descriptions above
<details>blocks in output.md, add missing software dependencies, sort arm.config alphabetically - PR #1724 - Update umitools modules: remove patches, add conda channel prefixes, update Wave containers
- PR #1725 - Refine .nftignore patterns to reinstate tx2gene MD5 checking, remove redundant UNTAR version collection
- PR #1728 - Re-render metro map with nf-metro v0.5.4: bolder section labels and number badges for improved visual hierarchy, increased vertical spacing between stacked sections, synchronized animation timing
- PR #1730 - Bump Nextflow from 25.04.0 to 25.04.3 in CI to fix
conda create --mkdirfailure (nextflow-io/nextflow#5947) - PR #1734 - Compose
test_prokaryoticprofile onprokaryoticto remove duplicated settings - PR #1735 - Code quality improvements: replace opaque tuple indexing with named destructuring, rename
ch_dummy_file, move metro map docs, align GPU CI workflow with other workflows - PR #1736 - Load prokaryotic config in nf-test instead of duplicating params in each test
- PR #1738 - Fix iGenomes STAR conda: use
biocondachannel for x86_64, reserveseqerachannel for ARM only
| Old parameter | New parameter |
|---|---|
--ribo_removal_tool |
|
--sylph_db |
|
--sylph_taxonomy |
|
--gffread_transcript_fasta |
|
--extra_bowtie2_align_args |
|
--use_parabricks_star |
NB: Parameter has been added if just the new parameter information is present.
| Profile | Description |
|---|---|
prokaryotic |
Settings optimized for bacterial/archaeal RNA-seq data |
| Dependency | Old version | New version |
|---|---|---|
bowtie2 |
2.5.4 | |
seqkit |
2.9.0 | |
sylph |
0.7.0 | |
sylph-tax |
1.2.0 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.22.2] - 2025-12-11
Special thanks to the following for their contributions to the release:
- PR #1654 - Fix tximport to handle tx2gene files with extra columns from
--gtf_extra_attributes, and fix sample name mangling in DESeq2 QC - PR #1655 - Fix duplicate flagstat files in MultiQC report when mark duplicates is enabled (#1653)
- PR #1656 - Bump version after release 3.22.1
- PR #1659 - Fix MultiQC sample name collisions when multiple samples share the same FASTQ filename (#1657)
- PR #1658 - Bump nf-core/multiqc module to 1.33
- PR #1660 - Update MultiQC ARM container to 1.33
- PR #1648 - Reduce duplicate coverage in pipeline tests by adding skip options
- PR #1661 - Bump version to 3.22.2 ahead of release
[3.22.1] - 2025-12-04
Special thanks to the following for their contributions to the release:
- PR #1640 - Bump version after release 3.22.0
- PR #1641 - Add arm-based CI tests and fix arm-related issues
- PR #1645 - Fix BAM CSI index access error with UMI deduplication (#1643)
- PR #1642 - Add long format to rsem merge
- PR #1650 - Restrict ARM CI tests to Docker profile only
- PR #1651 - Add dupMatrix files to nftignore to fix conda reproducibility issues
[3.22.0] - 2025-11-26
Special thanks to the following for their contributions to the release:
- Ahwan Pandey
- Cristina Tuñí i Domínguez
- Elad Herz
- Emily Miyoshi
- Jonathan Manning
- Pontus Höjer
- Siddhartha Bagaria
- PR #1608 - Bump version after release 3.21.0
- PR #1613 - Fix broken link and add latest kit version for Takara UMI prep in usage documentation
- PR #1614 - Template update for nf-core/tools v3.4.1
- PR #1617 - Update bbmap/bbsplit module
- PR #1618 - Fix CI: Ensure confirm-pass job runs for markdown-only PRs
- PR #1619 - Update Credits to reflect current maintainership
- PR #1620 - Fix bigwig strand labeling for reverse-stranded libraries (#1591)
- PR #1621 - Optimize qualimap performance with multi-threaded name sorting
- PR #1622 - Update tximeta/tximport module to fix sample name mangling
- PR #1624 - Document RSeQC inner_distance limitation for genomes with large chromosomes (>500 Mb), such as plant genomes
- PR #1625 - Add documentation warning about Qualimap read counting bug (#1273)
- PR #1628 - Template update for nf-core/tools v3.5.1
- PR #1630 - Fix arm64 profile to use pre-built ARM containers and update documentation
- PR #1631 - Fix bbsplit index staging by using symlinks instead of full copy
- PR #1632 - Add validation error for incompatible
--transcript_fastaand--additional_fastaparams (#1450) - PR #1635 - Fix
--gtf_extra_attributesto support multiple comma-separated values and correct deprecated parameter name in docs (#1626) - PR #1636 - Simplify workflow nextflow.config by consolidating redundant patterns
- PR #1638 - Bump version to 3.22.0 ahead of release
[3.21.0] - 2025-09-18
Special thanks to the following for their contributions to the release:
- PR #1597 - Bump version after release 3.20.0
- PR #1603 - Add bam input pathway
- PR #1604 - Enable BAM input for RSEM
- PR #1605 - Fix default for umi_discard_read to prevent validation errors in Platform
- PR #1606 - Bump version to 3.21.0 ahead of release
| Dependency | Old version | New version |
|---|---|---|
MultiQC |
1.30 | 1.31 |
[3.20.0] - 2025-08-18
Special thanks to the following for their contributions to the release:
- PR #1568 - Bump version after release 3.19.0
- PR #1571 - For umitools use only umi_dedup.sorted.log in multiqc_report
- PR #1573 - Fix salmon.merged.SummarizedExperiment.rds name collision
- PR #1585 - Update awsfulltest.yml to restore aligner-wise outputs
- PR #1580 - Template update for nf-core/tools v3.3.2
- PR #1590 - Addition of Sentieon STAR
- PR #1594 - Exclude star rsem pca from snaps
- PR #1595 - Exclude unstable star_rsem clusterings from snaps
| Dependency | Old version | New version |
|---|---|---|
MultiQC |
1.29 | 1.30 |
Sentieon |
202503.01 |
[3.19.0] - 2025-06-10
Special thanks to the following for their contributions to the release:
- Alena Nikolaeva
- Anabella Trigila
- Angel Pizarro
- Ben Sherman
- Dave Carlson
- Gabriel Lichtenstein
- Jonathan Manning
- Lorenzo Fontana
- Matthias Hörtenhuber
- Milos Micik
- Maxime Garcia
- PR #1480 - Bump version after release 3.18.0
- PR #1482 - Update trimgalore module for save_unpaired fix
- PR #1486 - Bump STAR build for multiprocessing fix
- PR #1490 - Make genomic FASTA input optional
- PR #1496 - Template update for nf-core/tools v3.2.0
- PR #1499 - Bump MultiQC module to 1.27
- PR #1508 - Fix missing Bracken results in the MultiQC report
- PR #1528 - Improve JSON schema validation files
- PR #1523 - Update preprocessing subworkflow to fix linting block on trimming
- PR #1521 - Updated Perl conda package version for local module gtf2bed for Arm compatibility.
- PR #1550 - Simplify
SummarizedExperimentoutputs. - PR #1553 - Make jobs automatically resubmit for exit code 175
- PR #1556 - Update index docs to remove references to 'indexing only mode'
- PR #1558 - Remove patches and bump modules for ARM compat
- PR #1560 - General modules update, include updating subworkflows for stats fixes
- PR #1563 - Version bumps ahead of 3.19.0 release
- PR #1565 - Improve reproducibility with Conda
- PR #1567 - Prerelease 3.19.0 fixes
[3.18.0] - 2024-12-19
Special thanks to the following for their contributions to the release:
- PR #1369 - Add umicollapse as an alternative to umi-tools
- PR #1461 - Add FASTQ linting during preprocessing
- PR #1463 - Move channel operations outside of the onComplete() block
- PR #1467 - Add test suite for UMI handling functionality
- PR #1466 - Factor out UMI handling
- PR #1470 - Update subworkflow to account for fix to bad argument handling
- PR #1469 - Minor docs fix
- PR #1459 - Remove reference to unused "skip_sample_count" value in email templates
- PR #1471 - Fix prepare_genome subworkflow for sortmerna
- PR #1473 - Bump STAR modules
- PR #1474 - Bump versions to 3.18.0
- PR #1475 - Fix log publishing around umitools/ umicollapse
- PR #1447 - Add tutorial series for analysing count data
- PR #1518 - Remove duplicated text in README.md
| Old parameter | New parameter |
|---|---|
--skip_linting |
|
--extra_fqlint_args |
|
--umi_dedup_tool |
| Dependency | Old version | New version |
|---|---|---|
UMICollapse |
1.1.0 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.17.0] - 2024-10-23
Special thanks to the following for their contributions to the release:
- PR #1418 - Bump versions back to 3.17.0 after release
- PR #1422 - Bump lots of modules so that conda versions have ARM builds
- PR #1423 - Bump STAR version for version with ARM Conda build
- PR #1424 - Patch sortmerna to 4.3.7 for ARM compatibility
- PR #1425 - Add profile for ARM compatibility
- PR #1432 - Bump versions for 3.17.0 release
- PR #1434 - Default registry is quay.io for all containers
- PR #1437 - Update software dependencies
- PR #1437 - Remove conda from test matrix
- PR #1440 - Further reduce matrix
- PR #1441 - Force some containers to be converted from docker to singularity to fix nf-core download tests
| Dependency | Old version | New version |
|---|---|---|
bbmap |
39.01 | 39.10 |
coreutils |
8.30 | 9.5 |
fq |
0.9.1 | 0.12.0 |
python (catadditionalfasta) |
3.9.5 | 3.12.2 |
python (tx2gene) |
3.9.5 | 3.10.4 |
cutadapt |
3.4 | 4.9 |
htslib (star) |
1.18 | 1.20 |
htslib |
1.20 | 1.21 |
kallisto |
0.48.0 | 0.51.1 |
preseq |
3.1.2 | 3.2.0 |
salmon |
1.10.1 | 1.10.3 |
samtools (star) |
1.18 | 1.20 |
samtools |
1.20 | 1.21 |
sortmerna |
4.3.6 | 4.3.7 |
star |
2.7.10a | =2.7.11b |
stringtie |
2.2.1 | 2.2.3 |
subread |
2.0.1 | 2.0.6 |
trim-galore |
0.6.7 | 0.6.10 |
ucsc-bedgraphtobigwig |
445 | 469 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.16.1] - 2024-10-16
Special thanks to the following for their contributions to the release:
- PR #1398 - Bump pipeline version to 3.17.0dev
- PR #1401 - Template update for nf-core/tools v3.0.1
- PR #1405 - Fix bad variable name in subworkflow
- PR #1406 - Keep only one samplesheetToList
- PR #1409 - Fix manifest DOI text
- PR #1410 - Fix issues caused by empty versions from trimming subworkflows
- PR #1412 - Reset versions back to 3.16.1 for patch release
- PR #1415 - Disable pipeline level conda tests
- PR #1416 - Really disable pipeline level conda tests
| Old parameter | New parameter |
|---|---|
--help_full |
|
--show_hidden |
|
--validationFailUnrecognisedParams |
|
--validationLenientMode |
|
--validationSchemaIgnoreParams |
|
--validationShowHiddenParams |
NB: Parameter has been updated if both old and new parameter information is present.
NB: Parameter has been added if just the new parameter information is present.
NB: Parameter has been removed if new parameter information isn't present.
| Dependency | Old version | New version |
|---|---|---|
MultiQC |
1.24.1 | 1.25.1 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.16.0] - 2024-10-02
Special thanks to the following for their contributions to the release:
- Adam Talbot
- Ezra Greenberg
- Jonathan Manning
- Matthias Zepper
- Maxime Garcia
- Paolo Di Tommaso
- Phil Ewels
- Shaun Regenbaum
- PR #1376 - Fix invalid named parameter syntax
- PR #1386 - Bump pipeline version to 3.16.0dev
- PR #1388 - Adding Kraken2/Bracken on unaligned reads as an additional quality control step to detect sample contamination
- PR #1389 - Update animated subway map
- PR #1393 - Use quay.io for bedtools/genomecov to solve issue with nf-core download
| Old parameter | New parameter |
|---|---|
--contaminant_screening |
|
--kraken_db |
|
--save_kraken_assignments |
|
--save_kraken_unassigned |
|
--bracken_precision |
NB: Parameter has been updated if both old and new parameter information is present.
NB: Parameter has been added if just the new parameter information is present.
NB: Parameter has been removed if new parameter information isn't present.
| Dependency | Old version | New version |
|---|---|---|
Kraken2 |
2.1.3 | |
Bracken |
2.9 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.15.1] - 2024-09-16
- PR #1374 - Bump pipeline version to 3.16.0dev
- PR #1379 - Enhance pipeline level test snapshots
- PR #1380 - Fix issues with R modules changing sample names
- PR #1381 - Update all modules following massive conda usage update in nf-core modules
- PR #1382 - Slight fixes for rnaseq preprocessing
- P$ #1383 - Prerelease 3.15.1 version bumps
[3.15.0] - 2024-09-04
Special thanks to the following for their contributions to the release:
- Adam Talbot
- David Carlson
- Edmund Miller
- Jonathan Manning
- Laramie Lindsey
- Luke Zappia
- Matthias Zepper
- Maxime Garcia
- Pieter Moris
- Rob Syme
- Thomas Danhorn
Thank you to everyone else that has contributed by reporting bugs, enhancements or in any other way, shape or form.
- PR #1180 - Bump pipeline version to 3.15.0dev
- PR #1186 - Properly update qualimap/rnaseq module (ie not patch)
- PR #1197 - Delete lib directory and replace with utils_* subworkflows
- PR #1199 - Replace modules.config with more modular config files per module/subworkflow/workflow
- PR #1201 - Template update for nf-core/tools v2.12
- PR #1206 - Remove
lib/directory andmodules.config - PR #1210 - Use pseudoalignment subworkflow components from nf-core/modules
- PR #1212 - Update all modules and subworkflows to latest versions
- PR #1213 - Pass transcriptome fasta through to samtools stats
- PR #1214 - Bump umitools + delocalise prepareforrsem (#831)
- PR #1216 - Delocalise catadditionalfasta (#1162)
- PR #1217 - Update Emiller88 => edmundmiller in README
- PR #1218 - Template update for nf-core/tools v2.13
- PR #1220 - Initialise nf-test and add pipeline level test
- PR #1221 - Use nf-test test for all nf-core components
- PR #1226 - Reuse bbsplit index and don't keep overwriting (#1225)
- PR #1228 - Make README usage consistent with docs/usage.md
- PR #1229 - Template update for nf-core/tools v2.13.1
- PR #1231 - Add sortmerna index possibilities
- PR #1232 - Add nf-test tests to star_genomegenerate_igenomes
- PR #1233 - Add nf-test tests to star_align_igenomes
- PR #1234 - Use genomecov from nf-core/modules
- PR #1235 - Add nf-test tests to utils_nfcore_rnaseq_pipeline tests
- PR #1236 - Add nf-test tests to gtf_filter
- PR #1237 - Fix concurrency error in Github CI workflow
- PR #1238 - Add nf-test tests to preprocess_transcripts_fasta_gencode
- PR #1239 - Add nf-test tests to align_star
- PR #1240 - Fix reference files params usage
- PR #1241 - Add nf-test tests to deseq2_qc
- PR #1242 - Use dupradar from nf-core/modules
- PR #1243 - Add nf-test for module MULTIQC_CUSTOM_BIOTYPE
- PR #1244 - Add gtf2bed tests
- PR #1245 - nf test quantify rsem
- PR #1246 - nf-test quantify pseudoalignment
- PR #1247 - nf-test prepare_genome
- PR #1248 - Improved ext.args consolidation for STAR and TRIMGALORE extra_args parameters
- PR #1249 - Include nf-tests for rsem_merge_counts module
- PR #1250 - Remove all tags.yml files because the testing system has changed
- PR #1251 - Replace deseq2qc paths
- PR #1252 - Fix genomeAttribute usage
- PR #1253 - Use nf-test files as matrix to test over in CI/CD for efficiency
- PR #1260 - Update CHANGELOG
- PR #1261 - Add more tests for PREPARE_GENOME
- PR #1262 - Fix CI pipeline
- PR #1264 - Add GHA files back into include statement of Github workflow change detection
- PR #1265 - Small updates noticed during code review
- PR #1266 - Delete unecessary tags from nf.test files for modules and subworkflows
- PR #1271 - Update trimming subworkflow to include more tests
- PR #1272 - Simple pipeline level nf-tests
- PR #1274 - Update bam_markduplicates_picard subworkflow
- PR #1278 - Delocalise pseudo quant workflow
- PR #1279 - Add pseudoaligner pipeline level tests to test suite
- PR #1280 - Reorganise pipeline level tests into flat directory structure
- PR #1282 - Fix CHANGELOG error
- PR #1283 - Add output files to nf-test snapshot
- PR #1293 - Update subworkflow utils_nfcore_pipeline
- PR #1297 - Important! Template update for nf-core/tools v2.14.1
- PR #1302 - Add missing files from Tximport processing
- PR #1304 - Remove redundant gene TPM outputs
- PR #1306 - Overhaul strandedness detection / comparison
- PR #1307 - Clarify infer strandedness step in subway map and text
- PR #1308 - Various MultiQC issues: FastQC sections for raw and trimmed reads // umi-tools dedup and extraction plots, custom content styling.
- PR #1309 - Document FASTP sampling
- PR #1310 - Reinstate pseudoalignment subworkflow config
- PR #1312 - Fix issues with unzipping of GTF/ GFF files without absolute paths
- PR #1314 - Add reference genome recommendations to usage docs
- PR #1317 - Strip problematic ifEmpty()
- PR #1319 - Reinstate oncomplete error messages
- PR #1321 - Remove push and release triggers from CI
- PR #1322 - Use pre-built Github Action to detect nf-test changes
- PR #1323 - Update actions/checkout to v4
- PR #1324 - Fix tags entries and rename pipeline level tests
- PR #1325 - Minor fixes to strandedness settings and messaging
- PR #1326 - Move Conda dependencies for local modules to individual environment file
- PR #1328 - Add pipeline level test for STAR-RSEM and HISAT2
- PR #1329 - Remove tags from all nf-test files
- PR #1330 - Update all nf-core/modules and subworkflows
- PR #1331 - Adding stubs for local modules
- PR #1334 - Update all nf-core/modules and subworkflows with stubs
- PR #1335 - Adding stubs at all levels
- PR #1336 - Use nf-core/setup-nf-test to install nf-test from cache during CI/CD
- PR #1340 - Remove out-of-date Azure specific guidance
- PR #1341 - Add rename in the MultiQC report for samples without techreps
- PR #1342 - Factor out preprocessing
- PR #1345 - Fix preprocessing call
- PR #1350 - Reduce resource usage for sort process in bedtools/genomecov
- PR #1352 - Assorted fixes to MultiQC usage
- PR #1353 - Correct conditional for salmon indexing in preprocessing workflow
- PR #1355 - Make all curves on subway map better looking, and all lines now have the same width
- PR #1357 - Fix anchor issue in multiqc
- PR #1358 - Update test profiles to restore a static URI for megatests
- PR #1359 - Update MultiQC and revert unnecessary workaround
- PR #1360 - More complete snapshots for pipeline level tests
- PR #1361 - Animate subway map
- PR #1362 - Move multiqc module prefix for nf-test to module
- PR #1363 - Minor updates of nf-core modules and subworkflows
- PR #1363 - Update dupradar script
- PR #1366 - Clarify docs on different tximport count files
- PR #1367 - Clarify design formula and blind dispersion estimation
- PR #1370 - Bump versions for 3.15.0
- PR #1371 - Apply Maxime's CHANGELOG edits
- PR #1372 - Bump tximeta/tximport for gene table row names fix
| Old parameter | New parameter |
|---|---|
--pipelines_testdata_base_path |
|
--sortmerna_index |
|
--stranded_threshold |
|
--unstranded_threshold |
|
--test_data_base |
| Dependency | Old version | New version |
|---|---|---|
bedtools |
2.30.0 | 2.31.1 |
bioconductor-dupradar |
1.28.0 | 1.32.0 |
bioconductor-summarizedexperiment |
1.24.0 | 1.32.0 |
bioconductor-tximeta |
1.12.0 | 1.20.1 |
gffread |
0.12.1 | 0.12.7 |
multiqc |
1.20 | 1.22.3 |
picard |
3.0.0 | 3.1.1 |
samtools |
1.17 | 1.20 |
sortmerna |
4.3.4 | 4.3.6 |
umi_tools |
1.14 | 1.15 |
untar |
1.3 | 1.34 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.14.0] - 2024-01-08
Special thanks to the following for their contributions to the release:
- Adam Talbot
- Jonathan Manning
- Mahesh Binzer-Panchal
- Matthias Zepper
- Maxime Garcia
- Phil Ewels
- Vlad Savelyev
Thank you to everyone else that has contributed by reporting bugs, enhancements or in any other way, shape or form.
- PR #1135 - Update action-tower-launch to v2 which supports more variable handling
- PR #1141 - Important! Template update for nf-core/tools v2.11
- PR #1143 - Move fasta check back to Groovy (#1142)
- PR #1144 - Interface to kmer size for pseudoaligners (#1111)
- PR #1149 - Fix and patch version commands for Fastp, FastQC and UMI-tools modules (#1103)
- PR #1150 - Be more flexible on attribute values in GTFs (#1132)
- PR #1151 - fix to #1150: reinstate conditional
- PR #1152 - Bump container versions for tools using Docker V1 manifest (#1140)
- PR #1154 - Prerelease 3.14.0 fixes (#1111, #1153)
- PR #1157 - Add slash to
--outdirfor cloud tests to fix Azure validation issue - PR #1159 - Issues loading MultiQC report (#1158)
| Old parameter | New parameter |
|---|---|
--pseudo_aligner_kmer_size |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if new parameter information isn't present.
| Dependency | Old version | New version |
|---|---|---|
multiqc |
1.17 | 1.19 |
qualimap |
2.2.2d | 2.3 |
rseqc |
3.0.1 | 5.0.3 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.13.2] - 2023-11-21
Special thanks to the following for their contributions to the release:
- PR #1123 - Overhaul tximport.r, output length tables
- PR #1124 - Ensure pseudoaligner is set if pseudoalignment is not skipped
- PR #1126 - Pipeline fails if transcript_fasta not provided and
skip_gtf_filter = true. - PR #1127 - Enlarge sampling to determine the number of columns in
filter_gtf.pyscript.
[3.13.1] - 2023-11-17
- PR #1121 - Changes for 3.13.1 patch release incl. igenomes star fix
[3.13.0] - 2023-11-17
Special thanks to the following for their contributions to the release:
Thank you to everyone else that has contributed by reporting bugs, enhancements or in any other way, shape or form.
- PR #1049 - Display a warning when
--extra_star_align_argsare used with--aligner star_rsem - PR #1051 - Remove
public_aws_ecrprofile - PR #1054 - Template update to nf-core/tools v2.9
- PR #1058 - Use
nf-validationplugin for parameter and samplesheet validation - PR #1068 - Update
grepversion foruntarmodule - PR #1073 - Update documentation to discourage use of
--genome - PR #1078 - Updated pipeline template to nf-core/tools 2.10
- PR #1083 - Move local modules and subworkflows to subfolders
- PR #1088 - Updates contributing and code of conduct documents with nf-core template 2.10
- PR #1091 - Reorganise parameters in schema for better usability
- PR #1106 - Kallisto quantification
- PR #1107 - Expand GTF filtering to remove rows with empty transcript ID when required, fix STAR GTF usage
- #976 - Add author and licenses for all custom scripts
- #1050 - Provide custom prefix/suffix for summary files to avoid overwriting
- #1074 - Enable quantification using StringTie AND a custom
- #1082 - More informative error message for
filter_gtf_for_genes_in_genome.py - #1102 - gene entries with empty transcript_id fields
| Dependency | Old version | New version |
|---|---|---|
fastqc |
0.11.9 | 0.12.1 |
multiqc |
1.14 | 1.17 |
ucsc-bedgraphtobigwig |
377 | 445 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
| Script | Old name | New name |
|---|---|---|
local/gtf_filter |
GTF_GENE_FILTER |
GTF_FILTER |
local/tx2gene |
SALMON_TX2GENE |
TX2GENE |
local/tximport |
SALMON_TXIMPORT |
TXIMPORT |
local/quantify_salmon |
QUANTIFY_SALMON |
QUANTIFY_PSEUDO_ALIGNMENT |
nf-core/kallisto_index |
KALLISTO_INDEX |
|
nf-core/kallisto_quant |
KALLISTO_QUANT |
[3.12.0] - 2023-06-02
Special thanks to the following for their contributions to the release:
Thank you to everyone else that has contributed by reporting bugs, enhancements or in any other way, shape or form.
- [#1011] - FastQ files from UMI-tools not being passed to fastp
- [#1018] - Ability to skip both alignment and pseudoalignment to only run pre-processing QC steps.
- PR #1016 - Updated pipeline template to nf-core/tools 2.8
- PR #1025 - Add
public_aws_ecr.configto source mulled containers when usingpublic.ecr.awsDocker Biocontainer registry - PR #1038 - Updated error log for count values when supplying
--additional_fasta - PR #1042 - revert samtools_sort modules to no memory assignement
| Old parameter | New parameter |
|---|---|
--skip_pseudo_alignment |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if new parameter information isn't present.
| Dependency | Old version | New version |
|---|---|---|
fastp |
0.23.2 | 0.23.4 |
samtools |
1.16.1 | 1.17 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.11.2] - 2023-04-25
Special thanks to the following for their contributions to the release:
Thank you to everyone else that has contributed by reporting bugs, enhancements or in any other way, shape or form.
- [#1003] -
FASTQ_SUBSAMPLE_FQ_SALMON:SALMON_INDEXis launched multiple times and fails
[3.11.1] - 2023-03-31
Special thanks to the following for their code contributions to the release:
- [#987] - Fix issue with incorrect cacheing of test datasets during CI/CD
- [#988] -
DESEQ2_QC_STAR_SALMONfails when sample names have many components - Remove
wait: falseoption from Tower Actions which is the default - Fix release trigger for full-sized multi-cloud tests
- Adding
[ci fast]to commit message now skips all tests except for standard-profile testpipeline run
[3.11.0] - 2023-03-30
Special thanks to the following for their code contributions to the release:
Thank you to everyone else that has contributed by reporting bugs, enhancements or in any other way, shape or form.
- Add infrastructure and CI for multi-cloud full-sized tests run via Nextflow Tower (see #981)
- Added fastp support.
- Users can now select between
--trimmer trimgalore(default) and--trimmer fastp. - Trim Galore! specific pipeline parameters have been deprecated:
--clip_r1,--clip_r2,--three_prime_clip_r1,--three_prime_clip_r2and--trim_nextseq - Any additional options can now be specified via the
--extra_trimgalore_argsand--extra_fastp_argsparameters, respectively.
- Users can now select between
- [#663] - Alternative trimming step for polyA/T removal
- [#781] - Add Warning for poly(A) libraries
- [#878] - Allow tabs in fasta header when creating decoys for salmon index
- [#931] - Save transcriptome BAM files when using
--save_umi_intermeds/--save_align_intermeds - [#934] - Union of
ext.argsandparams.extra_star_align_argsprevents parameter clashes in the STAR module - [#940] - Bugfix in
salmon_summarizedexperiment.rto ensurerbinddoesn't fail whenrowdatahas notxcolumn. - [#944] - Read clipping using clip_r1, clip_r2, three_prime_clip_r1, three_prime_clip_r2 disabled in 3.10
- [#956] - Implement 'auto' as default strandedness argument in
fastq_dir_to_samplesheet.pyscript - [#960] - Failure with awsbatch when running processes that are using
executor: local - [#961] - Add warnings to STDOUT for all skipped and failed strandedness check samples
- [#975] -
SALMON_INDEXruns when using--aligner star_rsemeven if samples have explicit strandedness - Remove HISAT2 from automated AWS full-sized tests
| Old parameter | New parameter |
|---|---|
--trimmer |
|
--extra_trimgalore_args |
|
--clip_r1 |
|
--clip_r2 |
|
--three_prime_clip_r1 |
|
--three_prime_clip_r2 |
|
--tracedir |
|
--trim_nextseq |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if new parameter information isn't present.
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
fastp |
0.23.2 | |
multiqc |
1.13 | 1.14 |
picard |
2.27.4 | 3.0.0 |
salmon |
1.9.0 | 1.10.1 |
umi_tools |
1.1.2 | 1.1.4 |
NB: Dependency has been updated if both old and new version information is present.
NB: Dependency has been added if just the new version information is present.
NB: Dependency has been removed if new version information isn't present.
[3.10.1] - 2023-01-05
- [#919] - Salmon quant not run after FastQ subsampling if index not provided
- [#922] - Passing TrimGalore
--hardtrim3/--hardtrim5via custom config raises missing output filename error
[3.10] - 2022-12-21
- Bump minimum Nextflow version from
21.10.3->22.10.1 - Updated pipeline template to nf-core/tools 2.7.2
- [#729] - Add 'auto' option to samplesheet to automatically detect strandedness for samples
- [#889] - Document valid options for
--genomeparameter - [#891] - Skip MarkDuplicates when UMIs are used
- [#896] - Remove
copyTocall for iGenomes README - [#897] - Use
--skip_preseqby default - [#898] - Documentation on salmon decoy-aware index creation, gcbias and seqbias
- [#900] - Add
--recursiveoption tofastq_dir_to_samplesheet.pyscript - [#902] -
check_samplesheet.pyscript doesn't output optional columns in samplesheet - [#907] - Add
--extra_star_align_argsand--extra_salmon_quant_argsparameter - [#912] - Add UMI deduplication before quantification in tube map
| Old parameter | New parameter |
|---|---|
--enable_conda |
|
--extra_star_align_args |
|
--extra_salmon_quant_args |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if new parameter information isn't present.
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
bbmap |
38.93 | 39.01 |
bioconductor-dupradar |
1.18.0 | 1.28.0 |
bioconductor-summarizedexperiment |
1.20.0 | 1.24.0 |
bioconductor-tximeta |
1.8.0 | 1.12.0 |
fq |
0.9.1 | |
salmon |
1.5.2 | 1.9.0 |
samtools |
1.15.1 | 1.16.1 |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[3.9] - 2022-09-30
- [#746] - Add
tin.pyoutput to MultiQC report - [#841] - Turn
--deseq2_vston by default - [#853] - Pipeline fails at email step: Failed to invoke
workflow.onCompleteevent handler - [#857] - Missing parameter required by StringTie if using STAR as aligner
- [#862] - Filter samples that have no reads after trimming
- [#864] - Pre-process transcripts fasta when using
--gencode - Expose additional arguments to UMI-tools as pipeline params:
--umitools_bc_pattern2is required if the UMI is located on read 2.--umitools_umi_separatorwill often be needed in conjunction with--skip_umi_extractas most other tools such as Illumina'sBCL Convertuse a colon instead of an underscore to separate the UMIs. The--umitools_grouping_methodallows to fine-tune handling of similar but non-identical UMIs. - Updated pipeline template to nf-core/tools 2.5.1
| Old parameter | New parameter |
|---|---|
--umitools_bc_pattern2 |
|
--umitools_umi_separator |
|
--umitools_grouping_method |
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
hisat2 |
2.2.0 | 2.2.1 |
multiqc |
1.11 | 1.13 |
picard |
2.26.10 | 2.27.4 |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[3.8.1] - 2022-05-27
- [#834] -
nf-core downloadfails with version 3.8 of the pipeline
[3.8] - 2022-05-25
Fixed quite a well hidden bug in the UMI processing mode of the pipeline when using --with_umi --aligner star_salmon as reported by Lars Roed Ingerslev. Paired-end BAM files were not appropriately name sorted after umi_tools dedup which ultimately resulted in incorrect reading and quantification with Salmon. If you have used previous versions of the pipeline to analyse paired-end UMI data it will need to be reprocessed using this version of the pipeline. See #828 for more context.
- [#824] - Add explicit docs for usage of featureCounts in the pipeline
- [#825] - Pipeline fails due to trimming related removal of all reads from a sample
- [#827] - Control generation of --output-stats when running umi-tools dedup
- [#828] - Filter BAM output of UMI-tools dedup before passing to Salmon quant
- Updated pipeline template to nf-core/tools 2.4.1
| Old parameter | New parameter |
|---|---|
--min_trimmed_reads |
|
--umitools_dedup_stats |
[3.7] - 2022-05-03
- Updated default STAR version to latest available (
2.7.10a; see #808) - Vanilla Linux Docker container changed from
biocontainers/biocontainers:v1.2.0_cv1toubuntu:20.04to fix issues observed on GCP (see #764)
- [#762] - Explicitly set
--skip_bbsplit falsewith--bbsplit_fasta_listto use BBSplit - [#764] - Test fails when using GCP due to missing tools in the basic biocontainer
- [#765] - Add docs for the usage of nf-core/rnaseq with prokaryotic data
- [#775] - Incorrect columns in Salmon transcript files
- [#791] - Add outputs for umitools dedup summary stats
- [#797] - Add
--skip_umi_extractto account for pre-existing UMIs header embeddings. - [#798] - Decompress transcript fasta error
- [#799] - Issue with using
--retain_unpairedwith theFASTQC_UMITOOLS_TRIMGALORE:TRIMGALOREmodule - [#802] -
--bam_csi_indexerror generated if--skip_alignmentspecified - [#808] - Auto-detect usage of Illumina iGenomes reference
- [#809] - Add metro map for pipeline
- [#814] - Use decimal values for
--min_mapped_reads - Updated pipeline template to nf-core/tools 2.3.2
| Old parameter | New parameter |
|---|---|
--skip_umi_extract |
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
samtools |
1.14 | 1.15.1 |
star |
2.6.1d | 2.7.10a |
stringtie |
2.1.7 | 2.2.1 |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[3.6] - 2022-03-04
- nf-core/tools#1415 - Make
--outdira mandatory parameter - [#734] - Is a vulnerable picard still used ? log4j vulnerability
- [#744] - Auto-detect and raise error if CSI is required for BAM indexing
- [#750] - Optionally ignore R1 / R2 after UMI extraction process
- [#752] - How to set publishing mode for all processes?
- [#753] - Add warning when user provides
--transcript_fasta - [#754] - DESeq2 QC issue linked to
--count_colparameter - [#755] - Rename RSEM_PREPAREREFERENCE_TRANSCRIPTS process
- [#759] - Empty lines in samplesheet.csv cause a crash
- [#769] - Do not run RSeQC tin.py by default
| Old parameter | New parameter |
|---|---|
--publish_dir_mode |
|
--umi_discard_read |
NB: Parameter has been updated if both old and new parameter information is present.
NB: Parameter has been added if just the new parameter information is present.
NB: Parameter has been removed if new parameter information isn't present.
[3.5] - 2021-12-17
- Port pipeline to the updated Nextflow DSL2 syntax adopted on nf-core/modules
- Removed
--publish_dir_modeas it is no longer required for the new syntax
- Removed
- Bump minimum Nextflow version from
21.04.0->21.10.3 - Updated pipeline template to nf-core/tools 2.2
- [#664] - Conflict of library names for technical replicates
- [#720] - KeyError 'gene_id' in salmon_tx2gene.py
- [#724] - Deal with warnings generated when native NF processes are used
- [#725] - Untar needs
--no-same-owneron DNAnexus - [#727] - Fix transcriptome staging issues on DNAnexus for rsem/prepareference
- [#728] - Add RSeQC TIN.py as a quality metric for the pipeline
[3.4] - 2021-10-05
- Software version(s) will now be reported for every module imported during a given pipeline execution
- Added
python3shebang to appropriate scripts inbin/directory - [#407] - Filter mouse reads from PDX samples
- [#570] - Update SortMeRNA to use SilvaDB 138 (for commercial use)
- [#690] - Error with post-trimmed read 2 sample names from FastQC in MultiQC
- [#693] - Cutadapt version missing from MultiQC report
- [#697] - pipeline_report.{txt,html} missing from pipeline_info directory
- [#705] - Sample sheet error check false positive
| Old parameter | New parameter |
|---|---|
--bbsplit_fasta_list |
|
--bbsplit_index |
|
--save_bbsplit_reads |
|
--skip_bbsplit |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if parameter information isn't present.
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
bbmap |
38.93 | |
hisat2 |
2.2.0 | 2.2.1 |
picard |
2.23.9 | 2.25.7 |
salmon |
1.4.0 | 1.5.2 |
samtools |
1.12 | 1.13 |
sortmerna |
4.2.0 | 4.3.4 |
trim-galore |
0.6.6 | 0.6.7 |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[3.3] - 2021-07-29
- Updated pipeline template to nf-core/tools 2.1
- [#556] - Genome index is not recreated with --additional_fasta unless --star_index false
- [#668] - Salmon quant with UMI-tools does not work
- [#674] - Launch pipeline regex fails
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
samtools |
1.10 | 1.12 |
stringtie |
2.1.4 | 2.1.7 |
umi_tools |
1.1.1 | 1.1.2 |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[3.2] - 2021-06-18
- Removed workflow to download data from public databases in favour of using nf-core/fetchngs
- Added a stand-alone Python script
bin/fastq_dir_to_samplesheet.pyto auto-create samplesheet from a directory of FastQ files - Added docs about overwriting default container definitions to use latest versions e.g. Pangolin
- [#637] - Add
--salmon_quant_libtypeparameter to provide the--libTypeoption to salmon quantification - [#645] - Remove trailing slash from
params.igenomes_base - [#649] - DESeq2 fails with only one sample
- [#652] - Results files have incorrect file names
- [nf-core/viralrecon#201] - Conditional include are not expected to work
| Old parameter | New parameter |
|---|---|
--public_data_ids |
|
--skip_sra_fastq_download |
|
--salmon_quant_libtype |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if parameter information isn't present.
[3.1] - 2021-05-13
- Samplesheet format has changed from
group,replicate,fastq_1,fastq_2,strandednesstosample,fastq_1,fastq_2,strandedness- This gives users the flexibility to name their samples however they wish (see #550)
- PCA generated by DESeq2 will now be monochrome and will not be grouped by using the replicate id
- Updated Nextflow version to
v21.04.0(see nextflow#572) - Restructure pipeline scripts into
modules/,subworkflows/andworkflows/directories
- Updated pipeline template to nf-core/tools
1.14 - Initial implementation of a standardised samplesheet JSON schema to use with user interfaces and for validation
- Only FastQ files that require to be concatenated will be passed to
CAT_FASTQprocess - [#449] -
--genomeSAindexNbaseswill now be auto-calculated before building STAR indices - [#460] - Auto-detect and bypass featureCounts execution if biotype doesn't exist in GTF
- [#544] - Update test-dataset for pipeline
- [#553] - Make tximport output files using all the samples; identified by @j-andrews7
- [#561] - Add gene symbols to merged output; identified by @grst
- [#563] - samplesheet.csv merge error
- [#567] - Update docs to mention trimgalore core usage nuances
- [#568] -
--star_indexargument is ignored with--aligner star_rsemoption - [#569] - nextflow edge release documentation for running 3.0
- [#575] - Remove duplicated salmon output files
- [#576] - umi_tools dedup : Run before salmon to dedup counts
- [#582] - Generate a separate bigwig tracks for each strand
- [#583] - Samtools error during run requires use of BAM CSI index
- [#585] - Clarify salmon uncertainty for some transcripts
- [#604] - Additional fasta with GENCODE annotation results in biotype error
- [#610] - save R objects as RDS
- [#619] - implicit declaration of the workflow in main
- [#629] - Add and fix EditorConfig linting in entire pipeline
- [nf-core/modules#423] - Replace
publish_by_idmodule option topublish_by_meta - [nextflow#2060] - Pipeline execution hang when native task fail to be submitted
| Old parameter | New parameter |
|---|---|
--hisat_build_memory |
--hisat2_build_memory |
--gtf_count_type |
--featurecounts_feature_type |
--gtf_group_features_type |
--featurecounts_group_type |
--bam_csi_index |
|
--schema_ignore_params |
|
--show_hidden_params |
|
--validate_params |
|
--clusterOptions |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if parameter information isn't present.
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
bedtools |
2.29.2 | 2.30.0 |
multiqc |
1.9 | 1.10.1 |
preseq |
2.0.3 | 3.1.2 |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[3.0] - 2020-12-15
- You will need to install Nextflow
>=20.11.0-edgeto run the pipeline. If you are using Singularity, then features introduced in that release now enable the pipeline to directly download Singularity images hosted by Biocontainers as opposed to performing a conversion from Docker images (see #496). - The previous default of aligning BAM files using STAR and quantifying using featureCounts (
--aligner star) has been removed. The new default is to align with STAR and quantify using Salmon (--aligner star_salmon).- This decision was made primarily because of the limitations of featureCounts to appropriately quantify gene expression data. Please see Zhao et al., 2015 and Soneson et al., 2015).
- For similar reasons, quantification will not be performed if using
--aligner hisat2due to the lack of an appropriate option to calculate accurate expression estimates from HISAT2 derived genomic alignments.- This pipeline option is still available for those who have a preference for the alignment, QC and other types of downstream analysis compatible with the output of HISAT2. No gene-level quantification results will be generated.
- In a future release we hope to add back quantitation for HISAT2 using different tools.
- Updated pipeline template to nf-core/tools
1.12.1 - Bumped Nextflow version
20.07.1->20.11.0-edge - Added UCSC
bedClipmodule to restrict bedGraph file coordinates to chromosome boundaries - Check if Bioconda and conda-forge channels are set-up correctly when running with
-profile conda - Use
rsem-prepare-referenceand notgffreadto create transcriptome fasta file - [#494] - Issue running rnaseq v2.0 (DSL2) with test profile
- [#496] - Direct download of Singularity images via HTTPS
- [#498] - Significantly different versions of STAR in star_rsem (2.7.6a) and star (2.6.1d)
- [#499] - Use of salmon counts for DESeq2
- [#500, #509] - Error with AWS batch params
- [#511] - rsem/star index fails with large genome
- [#515] - Add decoy-aware indexing for salmon
- [#516] - Unexpected error [InvocationTargetException]
- [#525] - sra_ids_to_runinfo.py UnicodeEncodeError
- [#550] - handle samplesheets with replicate=0
| Old parameter | New parameter |
|---|---|
--fc_extra_attributes |
--gtf_extra_attributes |
--fc_group_features |
--gtf_group_features |
--fc_count_type |
--gtf_count_type |
--fc_group_features_type |
--gtf_group_features_type |
--singularity_pull_docker_container |
|
--skip_featurecounts |
NB: Parameter has been updated if both old and new parameter information is present. NB: Parameter has been added if just the new parameter information is present. NB: Parameter has been removed if parameter information isn't present.
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
bioconductor-summarizedexperiment |
1.18.1 | 1.20.0 |
bioconductor-tximeta |
1.6.3 | 1.8.0 |
picard |
2.23.8 | 2.23.9 |
requests |
2.24.0 | |
salmon |
1.3.0 | 1.4.0 |
ucsc-bedclip |
377 | |
umi_tools |
1.0.1 | 1.1.1 |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[2.0] - 2020-11-12
- Pipeline has been re-implemented in Nextflow DSL2
- All software containers are now exclusively obtained from Biocontainers
- Added a separate workflow to download FastQ files via SRA, ENA or GEO ids and to auto-create the input samplesheet (
ENA FTP; see--public_data_idsparameter) - Added and refined a Groovy
lib/of functions that include the automatic rendering of parameters defined in the JSON schema for the help and summary log information - Replace edgeR with DESeq2 for the generation of PCA and heatmaps (also included in the MultiQC report)
- Creation of bigWig coverage files using BEDTools and bedGraphToBigWig
- [#70] - Added new genome mapping and quantification route with RSEM via the
--aligner star_rsemparameter - [#72] - Samples skipped due to low alignment reported in the MultiQC report
- [#73, #435] - UMI barcode support
- [#91] - Ability to concatenate multiple runs of the same samples via the input samplesheet
- [#123] - The primary input for the pipeline has changed from
--readsglob to samplesheet--input. See usage docs. - [#197] - Samples failing strand-specificity checks reported in the MultiQC report
- [#227] - Removal of ribosomal RNA via SortMeRNA
- [#419] - Add
--additional_fastaparameter to provide ERCC spike-ins, transgenes such as GFP or CAR-T as additional sequences to align to
- Updated pipeline template to nf-core/tools
1.11 - Optimise MultiQC configuration for faster run-time on huge sample numbers
- Add information about SILVA licensing when removing rRNA to
usage.md - Fixed ansi colours for pipeline summary, added summary logs of alignment results
- [#281] - Add nag to cite the pipeline in summary
- [#302] - Fixed MDS plot axis labels
- [#338] - Add option for turning on/off STAR command line option (--sjdbGTFfile)
- [#344] - Added multi-core TrimGalore support
- [#351] - Fixes missing Qualimap parameter
-p - [#353] - Fixes an issue where MultiQC fails to run with
--skip_biotype_qcoption - [#357] - Fixes broken links
- [#362] - Fix error with gzipped annotation file
- [#384] - Changed SortMeRNA reference dbs path to use stable URLs (v4.2.0)
- [#396] - Deterministic mapping for STAR aligner
- [#412] - Fix Qualimap not being passed on correct strand-specificity parameter
- [#413] - Fix STAR unmapped reads not output
- [#434] - Fix typo reported for work-dir
- [#437] - FastQC uses correct number of threads now
- [#440] - Fixed issue where featureCounts process fails when setting
--fc_count_typeto gene - [#452] - Fix
--gffinput bug - [#345] - Fixes label name in FastQC process
- [#391] - Make publishDir mode configurable
- [#431] - Update AWS GitHub actions workflow with organization level secrets
- [#435] - Fix a bug where gzipped references were not extracted when
--additional_fastawas not specified - [#435] - Fix a bug where merging of RSEM output would fail if only one fastq provided as input
- [#435] - Correct RSEM output name (was saving counts but calling them TPMs; now saving both properly labelled)
- [#436] - Fix a bug where the RSEM reference could not be built
- [#458] - Fix
TMP_DIRfor process MarkDuplicates and Qualimap
| Old parameter | New parameter |
|---|---|
--reads |
--input |
--igenomesIgnore |
--igenomes_ignore |
--removeRiboRNA |
--remove_ribo_rna |
--rRNA_database_manifest |
--ribo_database_manifest |
--save_nonrRNA_reads |
--save_non_ribo_reads |
--saveAlignedIntermediates |
--save_align_intermeds |
--saveReference |
--save_reference |
--saveTrimmed |
--save_trimmed |
--saveUnaligned |
--save_unaligned |
--skipAlignment |
--skip_alignment |
--skipBiotypeQC |
--skip_biotype_qc |
--skipDupRadar |
--skip_dupradar |
--skipFastQC |
--skip_fastqc |
--skipMultiQC |
--skip_multiqc |
--skipPreseq |
--skip_preseq |
--skipQC |
--skip_qc |
--skipQualimap |
--skip_qualimap |
--skipRseQC |
--skip_rseqc |
--skipTrimming |
--skip_trimming |
--stringTieIgnoreGTF |
--stringtie_ignore_gtf |
--additional_fasta- FASTA file to concatenate to genome FASTA file e.g. containing spike-in sequences--deseq2_vst- Use vst transformation instead of rlog with DESeq2--enable_conda- Run this workflow with Conda. You can also use '-profile conda' instead of providing this parameter--min_mapped_reads- Minimum percentage of uniquely mapped reads below which samples are removed from further processing--multiqc_title- MultiQC report title. Printed as page header, used for filename if not otherwise specified--public_data_ids- File containing SRA/ENA/GEO identifiers one per line in order to download their associated FastQ files--publish_dir_mode- Method used to save pipeline results to output directory--rsem_index- Path to directory or tar.gz archive for pre-built RSEM index--rseqc_modules- Specify the RSeQC modules to run--save_merged_fastq- Save FastQ files after merging re-sequenced libraries in the results directory--save_umi_intermeds- If this option is specified, intermediate FastQ and BAM files produced by UMI-tools are also saved in the results directory--skip_bigwig- Skip bigWig file creation--skip_deseq2_qc- Skip DESeq2 PCA and heatmap plotting--skip_featurecounts- Skip featureCounts--skip_markduplicates- Skip picard MarkDuplicates step--skip_sra_fastq_download- Only download metadata for public data database ids and don't download the FastQ files--skip_stringtie- Skip StringTie--star_ignore_sjdbgtf- See #338--umitools_bc_pattern- The UMI barcode pattern to use e.g. 'NNNNNN' indicates that the first 6 nucleotides of the read are from the UMI--umitools_extract_method- UMI pattern to use. Can be either 'string' (default) or 'regex'--with_umi- Enable UMI-based read deduplication
--awsqueuecan now be provided via nf-core/configs if using AWS--awsregioncan now be provided via nf-core/configs if using AWS--compressedReferencenow auto-detected--markdup_java_optionsin favour of updating centrally on nf-core/modules--projectparameter from old NGI template--readPathsis not required since these are provided from the input samplesheet--sampleLevelnot required--singleEndis now auto-detected from the input samplesheet--skipEdgeRqc not performed by DESeq2 instead--star_memoryin favour of updating centrally on nf-core/modules if required- Strandedness is now specified at the sample-level via the input samplesheet
--forwardStranded--reverseStranded--unStranded--pico
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own Biocontainer. This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
| Dependency | Old version | New version |
|---|---|---|
bioconductor-dupradar |
1.14.0 | 1.18.0 |
bioconductor-summarizedexperiment |
1.14.0 | 1.18.1 |
bioconductor-tximeta |
1.2.2 | 1.6.3 |
fastqc |
0.11.8 | 0.11.9 |
gffread |
0.11.4 | 0.12.1 |
hisat2 |
2.1.0 | 2.2.0 |
multiqc |
1.7 | 1.9 |
picard |
2.21.1 | 2.23.8 |
qualimap |
2.2.2c | 2.2.2d |
r-base |
3.6.1 | 4.0.3 |
salmon |
0.14.2 | 1.3.0 |
samtools |
1.9 | 1.10 |
sortmerna |
2.1b | 4.2.0 |
stringtie |
2.0 | 2.1.4 |
subread |
1.6.4 | 2.0.1 |
trim-galore |
0.6.4 | 0.6.6 |
bedtools |
- | 2.29.2 |
bioconductor-biocparallel |
- | 1.22.0 |
bioconductor-complexheatmap |
- | 2.4.2 |
bioconductor-deseq2 |
- | 1.28.0 |
bioconductor-tximport |
- | 1.16.0 |
perl |
- | 5.26.2 |
python |
- | 3.8.3 |
r-ggplot2 |
- | 3.3.2 |
r-optparse |
- | 1.6.6 |
r-pheatmap |
- | 1.0.12 |
r-rcolorbrewer |
- | 1.1_2 |
rsem |
- | 1.3.3 |
ucsc-bedgraphtobigwig |
- | 377 |
umi_tools |
- | 1.0.1 |
bioconductor-edger |
- | - |
deeptools |
- | - |
matplotlib |
- | - |
r-data.table |
- | - |
r-gplots |
- | - |
r-markdown |
- | - |
NB: Dependency has been updated if both old and new version information is present. NB: Dependency has been added if just the new version information is present. NB: Dependency has been removed if version information isn't present.
[1.4.2] - 2019-10-18
- Minor version release for keeping Git History in sync
- No changes with respect to 1.4.1 on pipeline level
[1.4.1] - 2019-10-17
Major novel changes include:
- Update
igenomes.configwith NCBIGRCh38and most recent UCSC genomes - Set
autoMounts = trueby default forsingularityprofile
[1.4] - 2019-10-15
Major novel changes include:
- Support for Salmon as an alternative method to STAR and HISAT2
- Several improvements in
featureCountshandling of types other thanexon. It is possible now to handle nuclearRNAseq data. Nuclear RNA has un-spliced RNA, and the whole transcript, including the introns, needs to be counted, e.g. by specifying--fc_count_type transcript. - Support for outputting unaligned data to results folders.
- Added options to skip several steps
- Skip trimming using
--skipTrimming - Skip BiotypeQC using
--skipBiotypeQC - Skip Alignment using
--skipAlignmentto only use pseudoalignment using Salmon
- Skip trimming using
- Adjust wording of skipped samples in pipeline output
- Fixed link to guidelines #203
- Add
CitationandQuick Startsection toREADME.md - Add in documentation of the
--gffparameter
- Generate MultiQC plots in the results directory #200
- Get MultiQC to save plots as standalone files
- Get MultiQC to write out the software versions in a
.csvfile #185 - Use
fileinstead ofnew Fileto createpipeline_report.{html,txt}files, and properly create subfolders
- Restore
SummarizedExperimmentobject creation in the salmon_merge process avoiding increasing memory with sample size. - Fix sample names in feature counts and dupRadar to remove suffixes added in other processes
- Removed
genebody_coverageprocess #195 - Implemented Pearsons correlation instead of Euclidean distance #146
- Add
--stringTieIgnoreGTFparameter #206 - Removed unused
stringtiechannels forMultiQC - Integrate changes in
nf-core/tools v1.6template which resolved #90 - Moved process
convertGFFtoGTFbeforemakeSTARindex#215 - Change all boolean parameters from
snake_casetocamelCaseand vice versa for value parameters - Add SM ReadGroup info for QualiMap compatibility#238
- Obtain edgeR + dupRadar version information #198 and #112
- Add
--gencodeoption for compatibility of Salmon and featureCounts biotypes with GENCODE gene annotations - Added functionality to accept compressed reference data in the pipeline
- Check that gtf features are on chromosomes that exist in the genome fasta file #274
- Maintain all gff features upon gtf conversion (keeps
gene_biotypeorgene_typeto makefeatureCountshappy) - Add SortMeRNA as an optional step to allow rRNA removal #280
- Minimal adjustment of memory and CPU constraints for clusters with locked memory / CPU relation
- Cleaned up usage,
parameters.settings.jsonand thenextflow.config
- Dependency list is now sorted appropriately
- Force matplotlib=3.0.3
- Picard 2.20.0 -> 2.21.1
- bioconductor-dupradar 1.12.1 -> 1.14.0
- bioconductor-edger 3.24.3 -> 3.26.5
- gffread 0.9.12 -> 0.11.4
- trim-galore 0.6.1 -> 0.6.4
- gffread 0.9.12 -> 0.11.4
- rseqc 3.0.0 -> 3.0.1
- R-Base 3.5 -> 3.6.1
- Dropped CSVtk in favor of Unix's simple
cutandpasteutilities - Added Salmon 0.14.2
- Added TXIMeta 1.2.2
- Added SummarizedExperiment 1.14.0
- Added SortMeRNA 2.1b
- Add tximport and summarizedexperiment dependency #171
- Add Qualimap dependency #202
[1.3] - 2019-03-26
- Added configurable options to specify group attributes for featureCounts #144
- Added support for RSeqC 3.0 #148
- Added a
parameters.settings.jsonfile for use with the newnf-core launchhelper tool. - Centralized all configuration profiles using nf-core/configs
- Fixed all centralized configs for offline usage
- Hide %dup in multiqc report
- Add option for Trimming NextSeq data properly (@jburos work)
- Fixing HISAT2 Index Building for large reference genomes #153
- Fixing HISAT2 BAM sorting using more memory than available on the system
- Fixing MarkDuplicates memory consumption issues following #179
- Use
fileinstead ofnew Fileto create thepipeline_report.{html,txt}files to avoid creating local directories when outputting to AWS S3 folders - Fix SortMeRNA default rRNA db paths specified in assets/rrna-db-defaults.txt
- RSeQC 2.6.4 -> 3.0.0
- Picard 2.18.15 -> 2.20.0
- r-data.table 1.11.4 -> 1.12.2
- bioconductor-edger 3.24.1 -> 3.24.3
- r-markdown 0.8 -> 0.9
- csvtk 0.15.0 -> 0.17.0
- stringtie 1.3.4 -> 1.3.6
- subread 1.6.2 -> 1.6.4
- gffread 0.9.9 -> 0.9.12
- multiqc 1.6 -> 1.7
- deeptools 3.2.0 -> 3.2.1
- trim-galore 0.5.0 -> 0.6.1
- qualimap 2.2.2b
- matplotlib 3.0.3
- r-base 3.5.1
[1.2] - 2018-12-12
- Removed some outdated documentation about non-existent features
- Config refactoring and code cleaning
- Added a
--fcExtraAttributesoption to specify more than ENSEMBL gene names infeatureCounts - Remove legacy rseqc
strandRuleconfig code. #119 - Added STRINGTIE ballgown output to results folder #125
- HiSAT index build now requests
200GBmemory, enough to use the exons / splice junction option for building.- Added documentation about the
--hisatBuildMemoryoption.
- Added documentation about the
- BAM indices are stored and re-used between processes #71
- Fixed conda bug which caused problems with environment resolution due to changes in bioconda #113
- Fixed wrong gffread command line #117
- Added
cpus = 1toworkflow summary process#130
[1.1] - 2018-10-05
- Wrote docs and made minor tweaks to the
--skip_qcand associated options - Removed the depreciated
uppmax-modulesconfig profile - Updated the
hebbeconfig profile to use the newwithNamesyntax too - Use new
workflow.manifestvariables in the pipeline script - Updated minimum nextflow version to
0.32.0
- #77: Added back
executor = 'local'for theworkflow_summary_mqc - #95: Check if task.memory is false instead of null
- #97: Resolved edge-case where numeric sample IDs are parsed as numbers causing some samples to be incorrectly overwritten.
[1.0] - 2018-08-20
This release marks the point where the pipeline was moved from SciLifeLab/NGI-RNAseq over to the new nf-core community, at nf-core/rnaseq.
View the previous changelog at SciLifeLab/NGI-RNAseq/CHANGELOG.md
In addition to porting to the new nf-core community, the pipeline has had a number of major changes in this version. There have been 157 commits by 16 different contributors covering 70 different files in the pipeline: 7,357 additions and 8,236 deletions!
In summary, the main changes are:
- Rebranding and renaming throughout the pipeline to nf-core
- Updating many parts of the pipeline config and style to meet nf-core standards
- Support for GFF files in addition to GTF files
- Just use
--gffinstead of--gtfwhen specifying a file path
- Just use
- New command line options to skip various quality control steps
- More safety checks when launching a pipeline
- Several new sanity checks - for example, that the specified reference genome exists
- Improved performance with memory usage (especially STAR and Picard)
- New BigWig file outputs for plotting coverage across the genome
- Refactored gene body coverage calculation, now much faster and using much less memory
- Bugfixes in the MultiQC process to avoid edge cases where it wouldn't run
- MultiQC report now automatically attached to the email sent when the pipeline completes
- New testing method, with data on GitHub
- Now run pipeline with
-profile testinstead of using bash scripts
- Now run pipeline with
- Rewritten continuous integration tests with Travis CI
- New explicit support for Singularity containers
- Improved MultiQC support for DupRadar and featureCounts
- Now works for all users instead of just NGI Stockholm
- New configuration for use on AWS batch
- Updated config syntax to support latest versions of Nextflow
- Built-in support for a number of new local HPC systems
- CCGA, GIS, UCT HEX, updates to UPPMAX, CFC, BINAC, Hebbe, c3se
- Slightly improved documentation (more updates to come)
- Updated software packages
...and many more minor tweaks.
Thanks to everyone who has worked on this release!