Skip to content

Duplicate ontology terms #4

@Jon-bioinfo

Description

@Jon-bioinfo

I've come across these ontology terms which have the same samples:

"DOID:285 ! hairy cell leukemia": [ "lymphoma, malignant, hairy B-cell cell line:MLMA.CNhs11935.10775-110G1", "hairy cell leukemia cell line:Mo.CNhs11843.10712-109I1" ],
and
"DOID:1040 ! chronic lymphocytic leukemia": [ "lymphoma, malignant, hairy B-cell cell line:MLMA.CNhs11935.10775-110G1", "hairy cell leukemia cell line:Mo.CNhs11843.10712-109I1" ],

This seems to mean that the avg annotation column in the results is being given both terms separated by a comma e.g.
ACTGAGTAGATAGCAT-1,"DOID:1040 ! chronic lymphocytic leukemia, DOID:285 ! hairy cell leukemia",0.342386323

This is a bit annoying to parse when the ontology names themselves are often comma separated e.g. CL:0002397 ! CD14-positive, CD16-positive monocyte

I've not checked if there are other cases like this of duplicate ontology terms. Is this intended?

Best,

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions