Annie S. Guillaume, Kevin Leempoel, Aude Rogivue, Felix Gugerli, Christian Parisod, Stéphane Joost
{"title":"在基因型-环境关联研究中整合高分辨率环境代用指标。","authors":"Annie S. Guillaume, Kevin Leempoel, Aude Rogivue, Felix Gugerli, Christian Parisod, Stéphane Joost","doi":"10.1111/eva.13737","DOIUrl":null,"url":null,"abstract":"<p>Landscape genomic analyses associating genetic variation with environmental variables are powerful tools for studying molecular signatures of species' local adaptation and for detecting candidate genes under selection. The development of landscape genomics over the past decade has been spurred by improvements in resolutions of genomic and environmental datasets, allegedly increasing the power to identify putative genes underlying local adaptation in non-model organisms. Although these associations have been successfully applied to numerous species across a diverse array of taxa, the spatial scale of environmental predictor variables has been largely overlooked, potentially limiting conclusions to be reached with these methods. To address this knowledge gap, we systematically evaluated performances of genotype–environment association (GEA) models using predictor variables at multiple spatial resolutions. Specifically, we used multivariate redundancy analyses to associate whole-genome sequence data from the plant <i>Arabis alpina</i> L. collected across four neighboring valleys in the western Swiss Alps, with very high-resolution topographic variables derived from digital elevation models of grain sizes between 0.5 m and 16 m. These comparisons highlight the sensitivity of landscape genomic models to spatial resolution, where the optimal grain sizes were specific to variable type, terrain characteristics, and study extent. To assist in selecting variables at appropriate spatial resolutions, we demonstrate a practical approach to produce, select, and integrate multiscale variables into GEA models. After generalizing fine-grained variables to multiple spatial resolutions, a forward selection procedure is applied to retain only the most relevant variables for a particular context. Depending on the spatial resolution, the relevance for topographic variables in GEA studies calls for integrating multiple spatial scales into landscape genomic models. By carefully considering spatial resolutions, candidate genes under selection by a more realistic range of pressures can be detected for downstream analyses, with important applied implications for experimental research and conservation management of natural populations.</p>","PeriodicalId":168,"journal":{"name":"Evolutionary Applications","volume":null,"pages":null},"PeriodicalIF":3.5000,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11212006/pdf/","citationCount":"0","resultStr":"{\"title\":\"Integrating very high resolution environmental proxies in genotype–environment association studies\",\"authors\":\"Annie S. Guillaume, Kevin Leempoel, Aude Rogivue, Felix Gugerli, Christian Parisod, Stéphane Joost\",\"doi\":\"10.1111/eva.13737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Landscape genomic analyses associating genetic variation with environmental variables are powerful tools for studying molecular signatures of species' local adaptation and for detecting candidate genes under selection. The development of landscape genomics over the past decade has been spurred by improvements in resolutions of genomic and environmental datasets, allegedly increasing the power to identify putative genes underlying local adaptation in non-model organisms. Although these associations have been successfully applied to numerous species across a diverse array of taxa, the spatial scale of environmental predictor variables has been largely overlooked, potentially limiting conclusions to be reached with these methods. To address this knowledge gap, we systematically evaluated performances of genotype–environment association (GEA) models using predictor variables at multiple spatial resolutions. Specifically, we used multivariate redundancy analyses to associate whole-genome sequence data from the plant <i>Arabis alpina</i> L. collected across four neighboring valleys in the western Swiss Alps, with very high-resolution topographic variables derived from digital elevation models of grain sizes between 0.5 m and 16 m. These comparisons highlight the sensitivity of landscape genomic models to spatial resolution, where the optimal grain sizes were specific to variable type, terrain characteristics, and study extent. To assist in selecting variables at appropriate spatial resolutions, we demonstrate a practical approach to produce, select, and integrate multiscale variables into GEA models. After generalizing fine-grained variables to multiple spatial resolutions, a forward selection procedure is applied to retain only the most relevant variables for a particular context. Depending on the spatial resolution, the relevance for topographic variables in GEA studies calls for integrating multiple spatial scales into landscape genomic models. By carefully considering spatial resolutions, candidate genes under selection by a more realistic range of pressures can be detected for downstream analyses, with important applied implications for experimental research and conservation management of natural populations.</p>\",\"PeriodicalId\":168,\"journal\":{\"name\":\"Evolutionary Applications\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11212006/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Evolutionary Applications\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/eva.13737\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"EVOLUTIONARY BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Evolutionary Applications","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/eva.13737","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EVOLUTIONARY BIOLOGY","Score":null,"Total":0}
Integrating very high resolution environmental proxies in genotype–environment association studies
Landscape genomic analyses associating genetic variation with environmental variables are powerful tools for studying molecular signatures of species' local adaptation and for detecting candidate genes under selection. The development of landscape genomics over the past decade has been spurred by improvements in resolutions of genomic and environmental datasets, allegedly increasing the power to identify putative genes underlying local adaptation in non-model organisms. Although these associations have been successfully applied to numerous species across a diverse array of taxa, the spatial scale of environmental predictor variables has been largely overlooked, potentially limiting conclusions to be reached with these methods. To address this knowledge gap, we systematically evaluated performances of genotype–environment association (GEA) models using predictor variables at multiple spatial resolutions. Specifically, we used multivariate redundancy analyses to associate whole-genome sequence data from the plant Arabis alpina L. collected across four neighboring valleys in the western Swiss Alps, with very high-resolution topographic variables derived from digital elevation models of grain sizes between 0.5 m and 16 m. These comparisons highlight the sensitivity of landscape genomic models to spatial resolution, where the optimal grain sizes were specific to variable type, terrain characteristics, and study extent. To assist in selecting variables at appropriate spatial resolutions, we demonstrate a practical approach to produce, select, and integrate multiscale variables into GEA models. After generalizing fine-grained variables to multiple spatial resolutions, a forward selection procedure is applied to retain only the most relevant variables for a particular context. Depending on the spatial resolution, the relevance for topographic variables in GEA studies calls for integrating multiple spatial scales into landscape genomic models. By carefully considering spatial resolutions, candidate genes under selection by a more realistic range of pressures can be detected for downstream analyses, with important applied implications for experimental research and conservation management of natural populations.
期刊介绍:
Evolutionary Applications is a fully peer reviewed open access journal. It publishes papers that utilize concepts from evolutionary biology to address biological questions of health, social and economic relevance. Papers are expected to employ evolutionary concepts or methods to make contributions to areas such as (but not limited to): medicine, agriculture, forestry, exploitation and management (fisheries and wildlife), aquaculture, conservation biology, environmental sciences (including climate change and invasion biology), microbiology, and toxicology. All taxonomic groups are covered from microbes, fungi, plants and animals. In order to better serve the community, we also now strongly encourage submissions of papers making use of modern molecular and genetic methods (population and functional genomics, transcriptomics, proteomics, epigenetics, quantitative genetics, association and linkage mapping) to address important questions in any of these disciplines and in an applied evolutionary framework. Theoretical, empirical, synthesis or perspective papers are welcome.