Skip to content

DMRs have multiple entries, methylation rate missing values #5

@mkpython3

Description

@mkpython3

Hi,

I used Methylscore with default parameters to call DMRs between 23 samples separately for each context. I have noticed that some DMRs show up multiple times with one of their "methylation rate" columns (Nr. 6+) missing a value.
This might be a side effect of me changing the pipeline from using tbi index to csi.
The change was necessary to handle larger genomes which are not supported by tbi, however only a small proportion of the DMRs shows this weird behavior, therefore I am unsure if it is actually caused by the different index.

Here is an example:

chr1H	83540	84016	477	-3-2-.3-.21--223333-212	1:	2:28	3:64	1:HvBS013,HvBS024	2:HvBS005,HvBS012,HvBS016,HvBS017,HvBS023,HvBS048	3:HvBS002,HvBS008,HvBS018,HvBS019,HvBS020,HvBS021	#:26	CG	-
chr1H	83540	84016	477	-3-2-.3-.21--223333-212	1:89	2:	3:64	1:HvBS013,HvBS024	2:HvBS005,HvBS012,HvBS016,HvBS017,HvBS023,HvBS048	3:HvBS002,HvBS008,HvBS018,HvBS019,HvBS020,HvBS021	#:26	CG	-
chr1H	83540	84016	477	-3-2-.3-.21--223333-212	1:89	2:28	3:	1:HvBS013,HvBS024	2:HvBS005,HvBS012,HvBS016,HvBS017,HvBS023,HvBS048	3:HvBS002,HvBS008,HvBS018,HvBS019,HvBS020,HvBS021	#:26	CG	CG

The information in these lines is identical except for the methylation rate columns and the last column.

Do you have an idea of the exact cause or could you help me create a patch to use csi index correctly as it is necessary to handle sequences longer than 512 Mbp.

Thanks in advance,
Marius

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions