-
Notifications
You must be signed in to change notification settings - Fork 6
Home
PATO is a R package designed to analyze pangenomes (set of genomes) intra or inter species. It allows to analyze the core-genome, accessory genome and whole genome, the population structure, and the horizontal gene transfer dynamics. PATO uses, as core software, MASH , MMSeq2, Minimap2 and R.
These software can handle thousands of genomes using conventional computers
without the necessity to use on a HPC facilities. PATO can analyze data
in mash distance format (whole genome) or accnet format (accessory genome)
Most of the functions can handle both objets. Some functions are specific
for some kind of data. The primitive objet mmseq is a orthologous clustering that
is used to build accnet object, annotate the genomes and characterize
the core-, accessory- and pan-genome size (and dynamic).