V1.0.0 by guanqiaofeng · Pull Request #1 · icgc-argo-workflows/rnaaln

guanqiaofeng · 2024-09-10T17:53:27Z

RNA Seq Alignment Workflow Version 1.0.0
Please refer to README for test details

edsu7 · 2024-09-20T19:19:50Z

+
+## Pipeline tools
+
+- [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)


Unneeded or replace with relevant ones

…1.0.0

guanqiaofeng · 2024-09-23T18:23:51Z

+## Pipeline tools
+
+- [GffRead](https://pubmed.ncbi.nlm.nih.gov/32489650/)
+
+  > Pertea G, Pertea M. GFF Utilities: GffRead and GffCompare. F1000Res. 2020 Apr 28;9:ISCB Comm J-304. doi: 10.12688/f1000research.23297.2. eCollection 2020. PubMed PMID: 32489650; PubMed Central PMCID: PMC7222033.
+
+- [HISAT2](https://pubmed.ncbi.nlm.nih.gov/31375807/)
+


updated pipeline tools

edsu7 · 2024-09-24T20:44:48Z

+        samtools: \$(echo \$(samtools --version 2>&1) | sed 's/^.*samtools //; s/Using.*\$//')
+    END_VERSIONS
+    """
+}


Missing Stub for preview

edsu7 · 2024-09-24T20:48:32Z

+    output:
+    tuple val(meta), path("*.hisat2_Aligned.bam")    , emit: bam
+    tuple val(meta), path("*_summary.txt")           , emit: summary
+    tuple val(meta), path("*fastq.gz"), optional:true, emit: fastq


Is this for unmapped reads?

I'm not sure we're producing unmapped reads:
https://daehwankimlab.github.io/hisat2/manual/#:~:text=in%20the%20input.-,%2D%2Dun%2Dconc,-%3Cpath%3E%2C

edsu7 · 2024-09-24T20:57:17Z

+                    genome_annotation: "${params.genome_annotation}",
+                    read_groups_count: "${meta.numLanes}",
+                    study_id : "${meta.study_id}",
+                    date :"${new Date().format("yyyyMMdd")}",


Date was defined twice? Ideally date should be set prior to payload generation. If it's used before then we run the risk of a workflow terminating and duplicate work being generated b/c of a new date variable.

edsu7 · 2024-09-27T17:39:13Z

+        .set{ch_h_aln_payload}
+
+        // Make ALN payload
+        PAYLOAD_ALIGNMENT_H(  // [val (meta), [path(cram),path(crai)],path(analysis_json)]


Minor nitpick about the comment. Should be inline with the variable.
e.g.

PAYLOAD_ALIGNMENT_H( ch_h_aln_payload.upload, // [val (meta), [path(cram),path(crai)],path(analysis_json)] Channel.empty() .mix(STAGE_INPUT.out.versions) .mix(HISAT2_ALIGN.out.versions) .mix(MERG_DUP_H.out.versions) .collectFile(name: 'collated_versions.yml') )

edsu7 · 2024-09-27T17:39:40Z

+                experiment:"${meta.experiment}",
+                date:"${meta.date}",
+                read_group:"${info.read_group.collect()}",
+                data_type:"${info.data_type.collect()}",  // later check whether data type is correct **


left over note?

edsu7 · 2024-09-27T17:41:05Z

+    if (params.tools.split(',').contains('hisat2_aln')){
+
+        // HISAT2 - ALIGN //
+        index = Channel.fromPath(params.hisat2_index).collect()


I don't recall, was it decided to add indexing step into the workflow?

edsu7 · 2024-09-27T17:42:23Z

+        ch_multiqc = Channel.empty()
+        ch_multiqc = ch_multiqc.mix(ch_reports.collect{meta, report -> report}).ifEmpty([])
+
+        ch_multiqc_config = Channel.fromPath("$projectDir/assets/multiqc_config.yml", checkIfExists: true)


For defining variable, better to declare all files at the start of workflow. Easier management and readability

edsu7 · 2024-09-27T17:56:52Z

+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+*/
+
+params.study_id                    = WorkflowMain.getGenomeAttribute(params, 'study_id')


Redefining these are not needed.

See:
https://github.com/icgc-argo-workflows/prealnqc/blob/main/main.nf

edsu7

See above comments

guanqiaofeng and others added 4 commits September 10, 2024 12:13

version 1, readme need to be updated

116e95c

Update README.md

f5309ea

update config files

e382120

Update README.md

426d73a

guanqiaofeng self-assigned this Sep 10, 2024

guanqiaofeng requested review from edsu7 and lindaxiang September 10, 2024 17:53

edsu7 reviewed Sep 20, 2024

View reviewed changes

guanqiaofeng added 2 commits September 23, 2024 14:19

updated CITATIONS.md

32e1be8

Merge branch 'v1.0.0' of github.com:icgc-argo-workflows/rnaaln into v…

b98e01a

…1.0.0

guanqiaofeng commented Sep 23, 2024

View reviewed changes

edsu7 reviewed Sep 24, 2024

View reviewed changes

edsu7 reviewed Sep 27, 2024

View reviewed changes

edsu7 suggested changes Sep 27, 2024

View reviewed changes

lindaxiang approved these changes Oct 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V1.0.0#1

V1.0.0#1
guanqiaofeng wants to merge 6 commits intomainfrom
v1.0.0

guanqiaofeng commented Sep 10, 2024

Uh oh!

edsu7 Sep 20, 2024

Uh oh!

guanqiaofeng Sep 23, 2024

Uh oh!

edsu7 Sep 24, 2024

Uh oh!

edsu7 Sep 24, 2024

Uh oh!

edsu7 Sep 24, 2024 •

edited

Loading

Uh oh!

edsu7 Sep 27, 2024

Uh oh!

edsu7 Sep 27, 2024

Uh oh!

edsu7 Sep 27, 2024

Uh oh!

edsu7 Sep 27, 2024

Uh oh!

edsu7 Sep 27, 2024

Uh oh!

edsu7 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		## Pipeline tools

		- [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)

Conversation

guanqiaofeng commented Sep 10, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

edsu7 Sep 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

edsu7 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

edsu7 Sep 24, 2024 •

edited

Loading