Skip to content

Recommended workflow for generating simplitigs from bacterial pangenomes #11

@SionBayliss

Description

@SionBayliss

I am a little unclear on the recommended workflow for generating simplitigs from a bacterial pangenome.

As pangenomes aim to encompass all potential variants providing multiple input files via "-i" is probably unsuitable as the intersection would only include core K-mers (in all isolates). Is this correct? If that is the case, should the workflow be:

a) Concatenate all files (reads/assemblies separately) into a single fasta file and then process the concatenate with prophasm (might incur high memory overhead)

or

b) Compute simplitigs for each file and then concatenate the simplitig files together before running prophasm again? (lower memory usage)

Are there any other methodological considerations for these approaches that I have overlooked?

Thank you for your help,
Sion

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions