与人类健康和牲畜抗病相关的遗传数据的空间依赖性:地理学在支持“同一个健康”方法中的作用

International Journal of Health, Animal science and Food safety Pub Date : 2017-06-05 DOI:10.13130/2283-3927/8529

S. Joost

{"title":"与人类健康和牲畜抗病相关的遗传数据的空间依赖性:地理学在支持“同一个健康”方法中的作用","authors":"S. Joost","doi":"10.13130/2283-3927/8529","DOIUrl":null,"url":null,"abstract":"The spatial dependence of located health and/or genetic data can be used to detect clusters likely to reveal disease prevalence or signatures of adaptation possibly associated with characteristics of the local environment (high temperatures, air or water pollution), be it in humans or animals (Murtaugh et al . 2017). Most often, geographic maps are produced to represent health data. Medical information is transmitted through thematic choropleth maps. For instance administrative units are colored according to the variable of interest. But it is key to analyse health and/or genetic data by explicitly including geographic characteristics (distances, co-location) and also the potential and power of spatial statistics to detect specific patterns in the geographic distribution of disease occurrences (“make visible the invisible”). A classic example using clusters is the map produced by John Snow (Snow 1855) showing the number of deaths caused by a cholera outbreak in London. Looking at a detail of Snow's original map, it is possible to realize how he graphically represented the number of deaths, with short bold lines representing death occurrences (frequencies forming a kind of histogram) placed on the street at the addresses where it happened - what we currently name georeferencing. A cluster of death people is an effect observed on the territory, and the existence of such a cluster depends on an infected water pump located at the same place (the cause). How can this spatial dependence be detected and measured? It is possible to identify spatial patterns in the geographic space by means of spatial statistics. We need to determine whether the variable of interest is randomly distributed or spatially dependent, and to check if the patterns observed are robust to random permutations. We also need to explore the data, to find out what is the range of influence of this spatial dependence. Here we focus on the functioning of one among several measures of spatial autocorrelation named Moran’s (Moran 1950). Moran’s I translates the global relationship between the behavior of points and of their neighborhood. Measures of spatial dependence are key to detect and visualize spatial patterns in health and/or genetic data because spatial statistics can reveal signals that remain often hidden using thematic mapping. On the basis of the clusters highlighted by these exploratory methods, it is possible to formulate hypotheses about possible environmental or socio-economic causes and to test them with the help of confirmatory statistics. «Ideas come from previous explorations» John Tukey said in a paper published in 1980 in The American Statistician, in a paper entitled «We Need Both Exploratory and Confirmatory» (Tukey 1980). First explore and then confirm was already the reasoning applied by John Snow to detect death \"hot spots\" in London, which then allowed him to hypothesize that a particular water pump was infected, and finally to take public health steps to check the cholera epidemic. Two examples illustrate the use of these spatial statistics: first, a cohort named COLAUS and established in the city of Lausanne was used to replicate the results obtained with 120’000 adults from the UK Biobank study to test the hypothesis that high-risk obesogenic environments and behaviors accentuate genetic susceptibility to obesity (Tyrell et al . 2017). Our findings suggest that the obesogenic environment accentuates the risk of obesity in genetically susceptible adults. Of the factors we tested, relative social deprivation (Townsend Deprivation Index) best captures the aspects of the obesogenic environment responsible. We produced a map of Lausanne showing the results of bivariate Local Indicators of Spatial Association (LISA) involving: 1) the value of the genetic risk score (GRS) based on 69 genetic variants and associated with obesity as identified by the GIANT consortium (more than 330’000 individuals); 2) the Townsend Deprivation Index (TDI), a composite measure of deprivation based on unemployment, non-car ownership, non-home ownership and household overcrowding. The analysis permits to identify clusters where a high GRS depends on a high mean of the TDI calculated within a spatial lag of 800m. Compared with a previous analysis applied to BMI in Lausanne, we were able to delimit areas where genetic susceptibility and deprivation result in observed obesity. The second example is an application of landscape genomics (Joost et al . 2007) to goat breeds in Europe and to cattle in Uganda to show how measures of spatial autocorrelation can be used to identify similarities or differences in genotype occurrences between neighboring individuals that cannot be explained by chance (Stucki et al . 2016). In Uganda, LISA indicators applied to genomic data in the Ankole cattle breed reveal a pattern corresponding to the known geographic distribution of Trypanosoma brucei gambiense .","PeriodicalId":14105,"journal":{"name":"International Journal of Health, Animal science and Food safety","volume":"51 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2017-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Spatial dependence of genetic data related to human health and livestock disease resistance: a role for geography to support the One Health approach\",\"authors\":\"S. Joost\",\"doi\":\"10.13130/2283-3927/8529\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The spatial dependence of located health and/or genetic data can be used to detect clusters likely to reveal disease prevalence or signatures of adaptation possibly associated with characteristics of the local environment (high temperatures, air or water pollution), be it in humans or animals (Murtaugh et al . 2017). Most often, geographic maps are produced to represent health data. Medical information is transmitted through thematic choropleth maps. For instance administrative units are colored according to the variable of interest. But it is key to analyse health and/or genetic data by explicitly including geographic characteristics (distances, co-location) and also the potential and power of spatial statistics to detect specific patterns in the geographic distribution of disease occurrences (“make visible the invisible”). A classic example using clusters is the map produced by John Snow (Snow 1855) showing the number of deaths caused by a cholera outbreak in London. Looking at a detail of Snow's original map, it is possible to realize how he graphically represented the number of deaths, with short bold lines representing death occurrences (frequencies forming a kind of histogram) placed on the street at the addresses where it happened - what we currently name georeferencing. A cluster of death people is an effect observed on the territory, and the existence of such a cluster depends on an infected water pump located at the same place (the cause). How can this spatial dependence be detected and measured? It is possible to identify spatial patterns in the geographic space by means of spatial statistics. We need to determine whether the variable of interest is randomly distributed or spatially dependent, and to check if the patterns observed are robust to random permutations. We also need to explore the data, to find out what is the range of influence of this spatial dependence. Here we focus on the functioning of one among several measures of spatial autocorrelation named Moran’s (Moran 1950). Moran’s I translates the global relationship between the behavior of points and of their neighborhood. Measures of spatial dependence are key to detect and visualize spatial patterns in health and/or genetic data because spatial statistics can reveal signals that remain often hidden using thematic mapping. On the basis of the clusters highlighted by these exploratory methods, it is possible to formulate hypotheses about possible environmental or socio-economic causes and to test them with the help of confirmatory statistics. «Ideas come from previous explorations» John Tukey said in a paper published in 1980 in The American Statistician, in a paper entitled «We Need Both Exploratory and Confirmatory» (Tukey 1980). First explore and then confirm was already the reasoning applied by John Snow to detect death \\\"hot spots\\\" in London, which then allowed him to hypothesize that a particular water pump was infected, and finally to take public health steps to check the cholera epidemic. Two examples illustrate the use of these spatial statistics: first, a cohort named COLAUS and established in the city of Lausanne was used to replicate the results obtained with 120’000 adults from the UK Biobank study to test the hypothesis that high-risk obesogenic environments and behaviors accentuate genetic susceptibility to obesity (Tyrell et al . 2017). Our findings suggest that the obesogenic environment accentuates the risk of obesity in genetically susceptible adults. Of the factors we tested, relative social deprivation (Townsend Deprivation Index) best captures the aspects of the obesogenic environment responsible. We produced a map of Lausanne showing the results of bivariate Local Indicators of Spatial Association (LISA) involving: 1) the value of the genetic risk score (GRS) based on 69 genetic variants and associated with obesity as identified by the GIANT consortium (more than 330’000 individuals); 2) the Townsend Deprivation Index (TDI), a composite measure of deprivation based on unemployment, non-car ownership, non-home ownership and household overcrowding. The analysis permits to identify clusters where a high GRS depends on a high mean of the TDI calculated within a spatial lag of 800m. Compared with a previous analysis applied to BMI in Lausanne, we were able to delimit areas where genetic susceptibility and deprivation result in observed obesity. The second example is an application of landscape genomics (Joost et al . 2007) to goat breeds in Europe and to cattle in Uganda to show how measures of spatial autocorrelation can be used to identify similarities or differences in genotype occurrences between neighboring individuals that cannot be explained by chance (Stucki et al . 2016). In Uganda, LISA indicators applied to genomic data in the Ankole cattle breed reveal a pattern corresponding to the known geographic distribution of Trypanosoma brucei gambiense .\",\"PeriodicalId\":14105,\"journal\":{\"name\":\"International Journal of Health, Animal science and Food safety\",\"volume\":\"51 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Health, Animal science and Food safety\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.13130/2283-3927/8529\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Health, Animal science and Food safety","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.13130/2283-3927/8529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

所定位的健康和/或遗传数据的空间依赖性可用于检测可能揭示疾病流行或可能与当地环境特征(高温、空气或水污染)相关的适应特征的群集，无论是人类还是动物(Murtaugh等)。2017)。大多数情况下，绘制地理地图是为了表示卫生数据。医疗信息通过专题地形图传播。例如，行政单位是根据感兴趣的变量着色的。但是，分析健康和(或)遗传数据的关键是明确包括地理特征(距离、同处一地)，以及空间统计在发现疾病发生的地理分布的特定模式方面的潜力和力量("使不可见的显现出来")。John Snow制作的地图(Snow 1855)就是一个典型的使用集群的例子，它显示了伦敦霍乱爆发造成的死亡人数。仔细观察斯诺原始地图的细节，我们就能意识到他是如何用图形表示死亡人数的，用短的粗体线表示死亡事件发生的地点(频率形成一种直方图)——我们现在称之为地理参考。死亡人群聚集是在领土上观察到的一种影响，这种聚集的存在取决于位于同一地方的受感染水泵(原因)。如何检测和测量这种空间依赖性?利用空间统计方法可以识别地理空间中的空间格局。我们需要确定感兴趣的变量是随机分布的还是空间依赖的，并检查观察到的模式是否对随机排列具有鲁棒性。我们还需要探索数据，找出这种空间依赖性的影响范围是什么。在这里，我们将重点关注Moran 's (Moran 1950)空间自相关的几种测量方法之一的功能。莫兰的I解释了点的行为和它们的邻居之间的全局关系。空间依赖性措施是发现和可视化健康和/或遗传数据中的空间格局的关键，因为空间统计可以揭示利用专题制图往往隐藏的信号。在这些探索性方法强调的集群的基础上，可以对可能的环境或社会经济原因提出假设，并在确认统计数据的帮助下对其进行检验。John Tukey在1980年发表于《美国统计学家》的一篇题为《我们需要探索性和验证性》(Tukey 1980)的论文中说:“想法来自于以前的探索。”先探索，然后确认，这已经是约翰·斯诺用来探测伦敦死亡“热点”的推理，然后让他假设一个特定的水泵被感染了，最后采取公共卫生措施来控制霍乱的流行。两个例子说明了这些空间统计的使用:首先，在洛桑市建立了一个名为COLAUS的队列，用于复制来自英国生物银行研究的12万成年人的结果，以检验高风险致胖环境和行为加剧肥胖遗传易感性的假设(Tyrell等人)。2017)。我们的研究结果表明，致肥环境增加了遗传易感成年人肥胖的风险。在我们测试的因素中，相对社会剥夺(汤森剥夺指数)最能反映导致肥胖的环境因素。我们制作了一幅洛桑的地图，显示了二元空间关联局部指标(LISA)的结果，包括:1)遗传风险评分(GRS)的价值，GRS基于69个遗传变异，并与GIANT联盟(超过33万人)确定的肥胖相关;2)汤森剥夺指数(TDI)，这是一种基于失业、无车、无房和家庭拥挤程度的综合剥夺指标。该分析允许识别高GRS依赖于在800米空间滞后内计算的高TDI平均值的集群。与先前应用于洛桑BMI的分析相比，我们能够划定遗传易感性和剥夺导致观察到的肥胖的区域。第二个例子是景观基因组学的应用(Joost等人)。2007年)对欧洲的山羊品种和乌干达的牛进行了研究，以表明如何利用空间自相关的措施来确定相邻个体之间基因型发生的相似性或差异性，而这些相似性和差异性无法用偶然因素来解释(Stucki等人)。2016)。在乌干达，应用于Ankole牛品种基因组数据的LISA指标揭示了一种与已知的布氏冈比亚锥虫地理分布相对应的模式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Spatial dependence of genetic data related to human health and livestock disease resistance: a role for geography to support the One Health approach

The spatial dependence of located health and/or genetic data can be used to detect clusters likely to reveal disease prevalence or signatures of adaptation possibly associated with characteristics of the local environment (high temperatures, air or water pollution), be it in humans or animals (Murtaugh et al . 2017). Most often, geographic maps are produced to represent health data. Medical information is transmitted through thematic choropleth maps. For instance administrative units are colored according to the variable of interest. But it is key to analyse health and/or genetic data by explicitly including geographic characteristics (distances, co-location) and also the potential and power of spatial statistics to detect specific patterns in the geographic distribution of disease occurrences (“make visible the invisible”). A classic example using clusters is the map produced by John Snow (Snow 1855) showing the number of deaths caused by a cholera outbreak in London. Looking at a detail of Snow's original map, it is possible to realize how he graphically represented the number of deaths, with short bold lines representing death occurrences (frequencies forming a kind of histogram) placed on the street at the addresses where it happened - what we currently name georeferencing. A cluster of death people is an effect observed on the territory, and the existence of such a cluster depends on an infected water pump located at the same place (the cause). How can this spatial dependence be detected and measured? It is possible to identify spatial patterns in the geographic space by means of spatial statistics. We need to determine whether the variable of interest is randomly distributed or spatially dependent, and to check if the patterns observed are robust to random permutations. We also need to explore the data, to find out what is the range of influence of this spatial dependence. Here we focus on the functioning of one among several measures of spatial autocorrelation named Moran’s (Moran 1950). Moran’s I translates the global relationship between the behavior of points and of their neighborhood. Measures of spatial dependence are key to detect and visualize spatial patterns in health and/or genetic data because spatial statistics can reveal signals that remain often hidden using thematic mapping. On the basis of the clusters highlighted by these exploratory methods, it is possible to formulate hypotheses about possible environmental or socio-economic causes and to test them with the help of confirmatory statistics. «Ideas come from previous explorations» John Tukey said in a paper published in 1980 in The American Statistician, in a paper entitled «We Need Both Exploratory and Confirmatory» (Tukey 1980). First explore and then confirm was already the reasoning applied by John Snow to detect death "hot spots" in London, which then allowed him to hypothesize that a particular water pump was infected, and finally to take public health steps to check the cholera epidemic. Two examples illustrate the use of these spatial statistics: first, a cohort named COLAUS and established in the city of Lausanne was used to replicate the results obtained with 120’000 adults from the UK Biobank study to test the hypothesis that high-risk obesogenic environments and behaviors accentuate genetic susceptibility to obesity (Tyrell et al . 2017). Our findings suggest that the obesogenic environment accentuates the risk of obesity in genetically susceptible adults. Of the factors we tested, relative social deprivation (Townsend Deprivation Index) best captures the aspects of the obesogenic environment responsible. We produced a map of Lausanne showing the results of bivariate Local Indicators of Spatial Association (LISA) involving: 1) the value of the genetic risk score (GRS) based on 69 genetic variants and associated with obesity as identified by the GIANT consortium (more than 330’000 individuals); 2) the Townsend Deprivation Index (TDI), a composite measure of deprivation based on unemployment, non-car ownership, non-home ownership and household overcrowding. The analysis permits to identify clusters where a high GRS depends on a high mean of the TDI calculated within a spatial lag of 800m. Compared with a previous analysis applied to BMI in Lausanne, we were able to delimit areas where genetic susceptibility and deprivation result in observed obesity. The second example is an application of landscape genomics (Joost et al . 2007) to goat breeds in Europe and to cattle in Uganda to show how measures of spatial autocorrelation can be used to identify similarities or differences in genotype occurrences between neighboring individuals that cannot be explained by chance (Stucki et al . 2016). In Uganda, LISA indicators applied to genomic data in the Ankole cattle breed reveal a pattern corresponding to the known geographic distribution of Trypanosoma brucei gambiense .

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Health, Animal science and Food safety

自引率

0.00%

发文量