Skip to content
Val F. Lanza edited this page Jan 28, 2021 · 18 revisions

Pangenome Analysis Toolkit (PATO)

PATO is a R package designed to analyze pangenomes (set of genomes) intra or inter species. It allows to analyze the core-genome, accessory genome and whole genome, the population structure, and the horizontal gene transfer dynamics. PATO uses, as core software, MASH , MMSeq2, Minimap2 and R.

These software can handle thousands of genomes using conventional computers
without the necessity to use on a HPC facilities. PATO can analyze data in mash distance format (whole genome) or accnet format (accessory genome) Most of the functions can handle both objets. Some functions are specific for some kind of data. The primitive objet mmseq is a orthologous clustering that is used to build accnet object, annotate the genomes and characterize the core-, accessory- and pan-genome size (and dynamic).

Clone this wiki locally