Skip to content

merging of transcript ids and gene ids is frustrating to use #284

@diekhans

Description

@diekhans

The convention of merging the transcript id and gene id into transId_geneId is time-wasting to use in analysis pipelines.

When the GTF is written out, the transcript_id is set to just the trans_id part. So then if you run other tools on the GTF,
and want to access the flair files, you need to do id mapping from transId_geneId.

Plus using "_" conflicts with RefSeq accessions 🤯

Please don't encoding of metadata in ids, this has long been known to cause all kinds of grief in bioinformatics (TCGA). Please use two columns, one for transcript id and one for gene id in files. for BEDs, you can make it a BED12+1 if you want (nice for turning into a bigBed track.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions