Skip to content

Output related questions #109

@JensUweUlrich

Description

@JensUweUlrich

Hi,

I have some questions regarding the output of chopper layout. I tried to calculate the layout for the viral refseq and got the following header lines as part of the output

#HIGH_LEVEL_IBF max_bin_id:173
#MERGED_BIN_0 max_bin_id:153
#MERGED_BIN_1 max_bin_id:196
#MERGED_BIN_2 max_bin_id:117
#MERGED_BIN_3 max_bin_id:253
#MERGED_BIN_4 max_bin_id:127
#MERGED_BIN_5 max_bin_id:167
.
.
.
#MERGED_BIN_447 max_bin_id:34
#FILES  BIN_INDICES     NUMBER_OF_BINS
files.renamed/GCF_002826665.1_genomic.fna.gz    0;0     1;1
files.renamed/GCF_002219365.1_genomic.fna.gz    0;1     1;1
files.renamed/GCF_003847265.1_genomic.fna.gz    0;2     1;1
files.renamed/GCF_002826065.1_genomic.fna.gz    0;3     1;2
files.renamed/GCF_000915375.1_genomic.fna.gz    0;5     1;1
.
.
.
iles.renamed/GCF_001995575.1_genomic.fna.gz    432     1
files.renamed/GCF_001041755.1_genomic.fna.gz    433;0   1;35
files.renamed/GCF_001502095.1_genomic.fna.gz    433;35  1;29
files.renamed/GCF_000903335.1_genomic.fna.gz    434     1
files.renamed/GCF_002116175.1_genomic.fna.gz    435;0   1;35
files.renamed/GCF_016811445.1_genomic.fna.gz    435;35  1;29
files.renamed/GCF_001308775.1_genomic.fna.gz    436     1
files.renamed/GCF_001041035.1_genomic.fna.gz    437     1
files.renamed/GCF_000865825.1_genomic.fna.gz    438     1
files.renamed/GCF_002826725.1_genomic.fna.gz    439;0   1;40
files.renamed/GCF_000839765.1_genomic.fna.gz    439;40  1;24
files.renamed/GCF_001602085.1_genomic.fna.gz    440     1
files.renamed/GCF_002628245.1_genomic.fna.gz    441     1
files.renamed/GCF_000887095.1_genomic.fna.gz    442     1
files.renamed/GCF_000924835.1_genomic.fna.gz    443     1
files.renamed/GCF_000922335.1_genomic.fna.gz    444     1
files.renamed/GCF_001654305.1_genomic.fna.gz    445     1
files.renamed/GCF_000848085.2_genomic.fna.gz    446;0   1;32
files.renamed/GCF_000923135.1_genomic.fna.gz    446;32  1;32
files.renamed/GCF_001316375.1_genomic.fna.gz    447;0   1;34
files.renamed/GCF_000875305.1_genomic.fna.gz    447;34  1;30
files.renamed/GCF_000893455.1_genomic.fna.gz    448     1

As far as I can see, these are all merged bins, but what does max_bin_id refer to? And how can I infer the topology of the hierarchy from the output?
How can I interpret the BIN_INDICES and NUMBER_OF_BINS columns?

Cheers
Jens

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions