Overview of work done during time with Sally Aitken and Sam on the CoAdapTree project. More detailed descriptions accompany the linked repositories' READMEs.
✉️ lind.brandon.m@gmail.com
🐦 @iowensis
🎓 google scholar
🔬 research page
🔗 LinkedIn
If you use or are inspired by code in this repository please cite the following work or contact me about how to cite.
-
CoAdapTree pool-seq VarScan pipeline
- autonomous SNP calling pipeline for pool-seq data with VarScan
Lind B (2021) GitHub.com/CoAdapTree/varscan_pipeline: Publication release (Version 1.0.0). Zenodo. http://doi.org/10.5281/zenodo.5083302 -
- This code takes output from the VarScan pipeline and calculates Cochran-Mantel-Haenszel chi-squared tests on stratified contingency tables of read counts standardized by population ploidy.
Lind B (2021) GitHub.com/brandonlind/cmh_test: preprint release (Version 1.0.0). Zenodo. http://doi.org/10.5281/zenodo.5083798 -
Douglas-fir GEA populations (in progress)
- This repository contains the processessing scripts needed to analyze range-wide populations of Douglas-fir.
- Lind BM. 2023. GitHub.com/brandonlind/douglas_fir_natural_populations: Offset Revision 1 (v1.0.0). Zenodo. https://doi.org/10.5281/zenodo.8018894
-
Jack pine GEA populations (in progress)
- This repository contains the processessing scripts needed to analyze range-wide populations of jack pine.
- Lind BM. 2023. GitHub.com/brandonlind/jack_pine_natural_populations: Offset Revision 1 (v1.0.0). Zenodo. https://doi.org/10.5281/zenodo.8018892
-
CoAdapTree Douglas-fir / lodgepole pine SNP array (in progress)
- This repository covers the code used to develop a 50k SNP array split evenly between Douglas-fir and lodgepole pine.
-
(2021) BM Lind* , M Lu*, D Vidakovic, P Singh, T Booker, S Yeaman, S Aitken. Haploid, diploid, and pooled exome capture recapitulate features of biology and paralogy in two non-model tree species. Molecular Ecology Resources. Direct Link or Download
- Abstract Despite their suitability for studying evolution, many conifer species have large and repetitive giga-genomes (16-31Gbp) that create hurdles to producing high coverage SNP datasets that captures diversity from across the entirety of the genome. Due in part to multiple ancient whole genome duplication events, gene family expansion and subsequent evolution within Pinaceae, false diversity from the misalignment of paralog copies create further challenges in accurately and reproducibly inferring evolutionary history from sequence data. Here, we leverage the cost-saving benefits of pool-seq and exome-capture to discover SNPs in two conifer species, Douglas-fir (Pseudotsuga menziesii var. menziesii (Mirb.) Franco, Pinaceae) and jack pine (Pinus banksiana Lamb., Pinaceae). We show, using minimal baseline filtering, that allele frequencies estimated from pooled individuals show a strong positive correlation with those estimated by sequencing the same population as individuals (r > 0.948), on par with such comparisons made in model organisms. Further, we highlight the use of haploid megagametophyte tissue in identifying sites that are likely due to misaligned paralogs. Together with additional minor filtering, we show that it is possible to remove many of the loci with large frequency estimate discrepancies between individual and pooled sequencing approaches, improving the correlation further (r > 0.973). Our work addresses bioinformatic challenges in non-model organisms with large and complex genomes, highlights the use of megagametophyte tissue for the identification of paralog sites when sequencing large numbers of populations, and suggests the combination of pool-seq and exome capture to be robust for further evolutionary hypothesis testing in these systems.
- Code for manuscript
- Data for manuscript
-
(2022) P Singh, B St Clair, BM Lind , R Cronn, NP Wilhelmi, M Lu, D Vidakovic, R Hamelin, D Shaw, S Aitken, S Yeaman Genomic architecture of resistance and tolerance to Swiss needle cast and Rhabdocline needle cast diseases in Douglas-fir. EcoEvoRxiv https://doi.org/10.32942/osf.io/s5rd7
- Abstract Understanding the genetic architecture of tolerance and resistance to pathogens is important to monitor and maintain resilient tree populations. Here we investigate the genetic basis of tolerance and resistance and to needle cast disease in Douglas-fir (Pseudotsuga menziesii) caused by two fungal pathogens: Swiss needle cast (SNC) caused by Nothophaeocryptopus gaeumannii, and Rhabdocline needle cast (RNC) caused by Rhabdocline pseudotsugae). We performed a case-control genome-wide association analysis (GWAS) and found these traits to be polygenic. Significant associations with SNC resistance were found for SNPs in genes for stomatal regulation and ethylene and jasmonic acid pathways, which are known for their roles in plant defense and immunity. Top-associated SNPs for SNC tolerance were found in genes of secondary metabolite pathways. We identified a key upstream transcription factor of plant defence, ERF1, as the main candidate for RNC resistance. Our findings contribute to the understanding of the highly polygenic architectures underlying disease resistance and tolerance in Douglas-fir and have important implications for forestry and conservation as the climate changes.
- Code for Manuscript
-
(2023) BM Lind , R Candido-Ribeiro, P Singh, M Lu, DO Vidakovic, TR Booker, M Whitlock, N Isabel, S Yeaman, and SN Aitken.How useful is genomic data for predicting maladaptation to future climate? biorxiv, doi: 10.1101/2023.02.10.528022.
- Abstract Methods using genomic information to forecast potential population maladaptation to climate change are becoming increasingly common, yet the lack of model validation poses serious hurdles toward their incorporation into management and policy. Here, we compare the validation of two methods – Gradient Forests (GF) and the Risk Of Non-Adaptedness – using exome capture pool-seq data from 35 to 39 populations across three conifer taxa: two Douglas-fir varieties and jack pine. We evaluate sensitivity of these algorithms to the source of input loci (markers selected from genotype-environment associations [GEA] or those selected at random). We validate these methods against two-year and 52-year growth and mortality measured in independent transplant experiments. Overall, we find that both methods often better predict transplant performance than climatic or geographic distances. We also find that while GF and RONA models are surprisingly not improved using GEA candidates. Even with promising validation results, ambiguity of model projections to future climates makes it difficult to identify the most maladapted populations using either method. Our work advances understanding of the sensitivity and applicability of these approaches, and we discuss recommendations for their future use.
- Code for manuscript