Caitlin E. Carey, Rebecca Shafee, Robbee Wedow, Amanda Elliott, Duncan S. Palmer, John Compitello, Masahiro Kanai, Liam Abbott, Patrick Schultz, Konrad J. Karczewski, Samuel C. Bryant, Caroline M. Cusick, Claire Churchhouse, Daniel P. Howrigan, Daniel King, George Davey Smith, Benjamin M. Neale, Raymond K. Walters, Elise B. Robinson
{"title":"英国生物库表型数据的原理提炼揭示了人类变异的潜在结构","authors":"Caitlin E. Carey, Rebecca Shafee, Robbee Wedow, Amanda Elliott, Duncan S. Palmer, John Compitello, Masahiro Kanai, Liam Abbott, Patrick Schultz, Konrad J. Karczewski, Samuel C. Bryant, Caroline M. Cusick, Claire Churchhouse, Daniel P. Howrigan, Daniel King, George Davey Smith, Benjamin M. Neale, Raymond K. Walters, Elise B. Robinson","doi":"10.1038/s41562-024-01909-5","DOIUrl":null,"url":null,"abstract":"Data within biobanks capture broad yet detailed indices of human variation, but biobank-wide insights can be difficult to extract due to complexity and scale. Here, using large-scale factor analysis, we distill hundreds of variables (diagnoses, assessments and survey items) into 35 latent constructs, using data from unrelated individuals with predominantly estimated European genetic ancestry in UK Biobank. These factors recapitulate known disease classifications, disentangle elements of socioeconomic status, highlight the relevance of psychiatric constructs to health and improve measurement of pro-health behaviours. We go on to demonstrate the power of this approach to clarify genetic signal, enhance discovery and identify associations between underlying phenotypic structure and health outcomes. In building a deeper understanding of ways in which constructs such as socioeconomic status, trauma, or physical activity are structured in the dataset, we emphasize the importance of considering the interwoven nature of the human phenome when evaluating public health patterns. Carey and colleagues reveal 35 major latent constructs (factors) in the phenotype data of unrelated individuals with predominantly estimated European genetic ancestry from UK Biobank.","PeriodicalId":19074,"journal":{"name":"Nature Human Behaviour","volume":"8 8","pages":"1599-1615"},"PeriodicalIF":21.4000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41562-024-01909-5.pdf","citationCount":"0","resultStr":"{\"title\":\"Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation\",\"authors\":\"Caitlin E. Carey, Rebecca Shafee, Robbee Wedow, Amanda Elliott, Duncan S. Palmer, John Compitello, Masahiro Kanai, Liam Abbott, Patrick Schultz, Konrad J. Karczewski, Samuel C. Bryant, Caroline M. Cusick, Claire Churchhouse, Daniel P. Howrigan, Daniel King, George Davey Smith, Benjamin M. Neale, Raymond K. Walters, Elise B. Robinson\",\"doi\":\"10.1038/s41562-024-01909-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data within biobanks capture broad yet detailed indices of human variation, but biobank-wide insights can be difficult to extract due to complexity and scale. Here, using large-scale factor analysis, we distill hundreds of variables (diagnoses, assessments and survey items) into 35 latent constructs, using data from unrelated individuals with predominantly estimated European genetic ancestry in UK Biobank. These factors recapitulate known disease classifications, disentangle elements of socioeconomic status, highlight the relevance of psychiatric constructs to health and improve measurement of pro-health behaviours. We go on to demonstrate the power of this approach to clarify genetic signal, enhance discovery and identify associations between underlying phenotypic structure and health outcomes. In building a deeper understanding of ways in which constructs such as socioeconomic status, trauma, or physical activity are structured in the dataset, we emphasize the importance of considering the interwoven nature of the human phenome when evaluating public health patterns. Carey and colleagues reveal 35 major latent constructs (factors) in the phenotype data of unrelated individuals with predominantly estimated European genetic ancestry from UK Biobank.\",\"PeriodicalId\":19074,\"journal\":{\"name\":\"Nature Human Behaviour\",\"volume\":\"8 8\",\"pages\":\"1599-1615\"},\"PeriodicalIF\":21.4000,\"publicationDate\":\"2024-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.nature.com/articles/s41562-024-01909-5.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature Human Behaviour\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.nature.com/articles/s41562-024-01909-5\",\"RegionNum\":1,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Human Behaviour","FirstCategoryId":"102","ListUrlMain":"https://www.nature.com/articles/s41562-024-01909-5","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation
Data within biobanks capture broad yet detailed indices of human variation, but biobank-wide insights can be difficult to extract due to complexity and scale. Here, using large-scale factor analysis, we distill hundreds of variables (diagnoses, assessments and survey items) into 35 latent constructs, using data from unrelated individuals with predominantly estimated European genetic ancestry in UK Biobank. These factors recapitulate known disease classifications, disentangle elements of socioeconomic status, highlight the relevance of psychiatric constructs to health and improve measurement of pro-health behaviours. We go on to demonstrate the power of this approach to clarify genetic signal, enhance discovery and identify associations between underlying phenotypic structure and health outcomes. In building a deeper understanding of ways in which constructs such as socioeconomic status, trauma, or physical activity are structured in the dataset, we emphasize the importance of considering the interwoven nature of the human phenome when evaluating public health patterns. Carey and colleagues reveal 35 major latent constructs (factors) in the phenotype data of unrelated individuals with predominantly estimated European genetic ancestry from UK Biobank.
期刊介绍:
Nature Human Behaviour is a journal that focuses on publishing research of outstanding significance into any aspect of human behavior.The research can cover various areas such as psychological, biological, and social bases of human behavior.It also includes the study of origins, development, and disorders related to human behavior.The primary aim of the journal is to increase the visibility of research in the field and enhance its societal reach and impact.