Skip to content

Multi-nucleotide variants not split correctly – potential normalization issue? #69

@longfei-stat

Description

@longfei-stat

Hi team,

Thanks for developing and maintaining this great tool.

I’ve noticed that some nearby variants are not being split into individual SNVs as expected. For example:

15850TA>CG
This appears to be a multi-nucleotide variant (MNV), but I would expect it to be normalized into two SNVs:

  • 15850T>C

  • 15851A>G

185_GCGAA_ACGAG
This looks like it should correspond to two SNVs:

  • 185G>A

  • 189A>G

This behavior makes downstream analysis and annotation more difficult. I suspect this might be related to how variant normalization is being handled in the pipeline.

Could you clarify whether these cases are expected behavior? And if not, is there a way to enable proper decomposition of MNVs into SNVs?

Thanks again for your support!

Kind regards,
Fei

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions