Multi-mapping reads filtering support in countNonSplicedReads

In the documentation from FRASER countRNAData, there is the following parameters: 
```
...  Further parameters passed on to Rsubread::featureCounts.
```

However, when running with the parameters: 
```
reads_onerun <- countRNAData(
    sampleID ="sample1",
    fds = fds, 
    NcpuPerSample = 8,
    recount = TRUE,
    keepNonStandardChromosomes = TRUE,
    genome = c("BSgenome.Hsapiens.UCSC.hg38"),
    countMultiMappingReads = FALSE
)
```
an error is returned: 
```
Error in getNonSplitReadCountsForAllSamples(fds = fds, splitCountRanges = splitCountRanges,  : 
  unused arguments (sampleID = "sample1", countMultiMappingReads = TRUE)
```

In the nonspliced read counts function, the `countMultiMappingReads` parameter is hardcoded to `FALSE`. https://github.com/gagneurlab/FRASER/blob/ee6f27963edb439bea81a9e187a8d58570a22119/R/countRNAseqData.R#L904-L925

This is different than countSplicedReads, which uses GenomicAlignment functions for counting which can use ScanBamParams and tagFilters for filtering bam files. 
https://github.com/gagneurlab/FRASER/blob/ee6f27963edb439bea81a9e187a8d58570a22119/R/countRNAseqData.R#L534-L537
https://github.com/gagneurlab/FRASER/blob/ee6f27963edb439bea81a9e187a8d58570a22119/R/countRNAseqData.R#L567-L569
https://github.com/gagneurlab/FRASER/blob/ee6f27963edb439bea81a9e187a8d58570a22119/R/countRNAseqData.R#L613-L614

What is the reason that you cannot alter the multimapping count parameters for nonspliced reads, but can for spliced reads? Or am I misunderstanding how these functions work in tandem, and that the filtering for multimapping reads is done by extension when defining the spliced ranges resulting from countSplicedReads and needed as input for countNonSplicedReads? 

	rsubreadCounts <- featureCounts(files=bamFile, annot.ext=anno,
	minOverlap=minAnchor*2,
	allowMultiOverlap=TRUE,
	checkFragLength=FALSE,
	minMQS=bamMapqFilter(scanBamParam(fds)),
	strandSpecific=strand,

	# activating long read mode
	isLongRead=longRead,

	# multi-mapping reads
	countMultiMappingReads=TRUE,

	# unstranded case: for counting only non spliced reads we
	# skip this information
	isPairedEnd=isPairedEnd,

	# sorting only needed for paired-end reads
	autosort=doAutosort,
	nthreads=NcpuPerSample,
	tmpDir=file.path(file_path_as_absolute(workingDir(fds)), "cache")
	)

	countsList <- bplapply(chromosomes, FUN=countSplitReadsPerChromosome,
	bamFile=bamfile, pairedEnd=pairedend, genome=genome,
	strandMode=strandmode, scanBamParam=scanbamparam,
	BPPARAM=getBPParam(NcpuPerSample, length(chromosomes)))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-mapping reads filtering support in countNonSplicedReads #91

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	galignment <- readGAlignmentPairs(
	bamFile, param=param, strandMode=strandMode)
	}

	jc <- summarizeJunctions(galignment, genome=genome,
	with.revmap=(as.logical(strandMode) && pairedEnd) )

Multi-mapping reads filtering support in countNonSplicedReads #91

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions