Yaoxi He, Xiaoming Zhang, Min-Sheng Peng, Yu-Chun Li, Kai Liu, Yu Zhang, Leyan Mao, Yongbo Guo, Yujie Ma, Bin Zhou, Wangshan Zheng, Tian Yue, Yuwen Liao, Shen-Ao Liang, Lu Chen, Weijie Zhang, Xiaoning Chen, Bixia Tang, Xiaofei Yang, Kai Ye, Shenghan Gao, Yurun Lu, Yong Wang, Shijie Wan, Rushan Hao, Xuankai Wang, Yafei Mao, Shanshan Dai, Zongliang Gao, Li-Qin Yang, Jianxin Guo, Jiangguo Li, Chao Liu, Jianhua Wang, Tuot Sovannary, Long Bunnath, Jatupol Kampuansai, Angkhana Inta, Metawee Srikummool, Wibhu Kutanan, Huy Quang Ho, Khoa Dang Pham, Sommay Singthong, Somphad Sochampa, U. Win Kyaing, Wittaya Pongamornkul, Chutima Morlaeku, Kittisak Rattanakrajangsri, Consortium of Anthropological Research in Southeast Asia and Southwest China (CASEAC), Qing-Peng Kong, Ya-Ping Zhang, Bing Su
{"title":"Genome diversity and signatures of natural selection in mainland Southeast Asia","authors":"Yaoxi He, Xiaoming Zhang, Min-Sheng Peng, Yu-Chun Li, Kai Liu, Yu Zhang, Leyan Mao, Yongbo Guo, Yujie Ma, Bin Zhou, Wangshan Zheng, Tian Yue, Yuwen Liao, Shen-Ao Liang, Lu Chen, Weijie Zhang, Xiaoning Chen, Bixia Tang, Xiaofei Yang, Kai Ye, Shenghan Gao, Yurun Lu, Yong Wang, Shijie Wan, Rushan Hao, Xuankai Wang, Yafei Mao, Shanshan Dai, Zongliang Gao, Li-Qin Yang, Jianxin Guo, Jiangguo Li, Chao Liu, Jianhua Wang, Tuot Sovannary, Long Bunnath, Jatupol Kampuansai, Angkhana Inta, Metawee Srikummool, Wibhu Kutanan, Huy Quang Ho, Khoa Dang Pham, Sommay Singthong, Somphad Sochampa, U. Win Kyaing, Wittaya Pongamornkul, Chutima Morlaeku, Kittisak Rattanakrajangsri, Consortium of Anthropological Research in Southeast Asia and Southwest China (CASEAC), Qing-Peng Kong, Ya-Ping Zhang, Bing Su","doi":"10.1038/s41586-025-08998-w","DOIUrl":null,"url":null,"abstract":"Mainland Southeast Asia (MSEA) has rich ethnic and cultural diversity with a population of nearly 300 million1,2. However, people from MSEA are underrepresented in the current human genomic databases. Here we present the SEA3K genome dataset (phase I), generated by deep short-read whole-genome sequencing of 3,023 individuals from 30 MSEA populations, and long-read whole-genome sequencing of 37 representative individuals. We identified 79.59 million small variants and 96,384 structural variants, among which 22.83 million small variants and 24,622 structural variants are unique to this dataset. We observed a high genetic heterogeneity across MSEA populations, reflected by the varied combinations of genetic components. We identified 44 genomic regions with strong signatures of Darwinian positive selection, covering 89 genes involved in varied physiological systems such as physical traits and immune response. Furthermore, we observed varied patterns of archaic Denisovan introgression in MSEA populations, supporting the proposal of at least two distinct instances of Denisovan admixture into modern humans in Asia3. We also detected genomic regions that suggest adaptive archaic introgressions in MSEA populations. The large number of novel genomic variants in MSEA populations highlight the necessity of studying regional populations that can help answer key questions related to prehistory, genetic adaptation and complex diseases. Analysis of whole-genome sequencing data from 3,023 human genomes from mainland Southeast Asia identifies high genetic heterogeneity among populations in the region and many unique small and structural variants.","PeriodicalId":18787,"journal":{"name":"Nature","volume":"643 8071","pages":"417-426"},"PeriodicalIF":50.5000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature","FirstCategoryId":"103","ListUrlMain":"https://www.nature.com/articles/s41586-025-08998-w","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Mainland Southeast Asia (MSEA) has rich ethnic and cultural diversity with a population of nearly 300 million1,2. However, people from MSEA are underrepresented in the current human genomic databases. Here we present the SEA3K genome dataset (phase I), generated by deep short-read whole-genome sequencing of 3,023 individuals from 30 MSEA populations, and long-read whole-genome sequencing of 37 representative individuals. We identified 79.59 million small variants and 96,384 structural variants, among which 22.83 million small variants and 24,622 structural variants are unique to this dataset. We observed a high genetic heterogeneity across MSEA populations, reflected by the varied combinations of genetic components. We identified 44 genomic regions with strong signatures of Darwinian positive selection, covering 89 genes involved in varied physiological systems such as physical traits and immune response. Furthermore, we observed varied patterns of archaic Denisovan introgression in MSEA populations, supporting the proposal of at least two distinct instances of Denisovan admixture into modern humans in Asia3. We also detected genomic regions that suggest adaptive archaic introgressions in MSEA populations. The large number of novel genomic variants in MSEA populations highlight the necessity of studying regional populations that can help answer key questions related to prehistory, genetic adaptation and complex diseases. Analysis of whole-genome sequencing data from 3,023 human genomes from mainland Southeast Asia identifies high genetic heterogeneity among populations in the region and many unique small and structural variants.
期刊介绍:
Nature is a prestigious international journal that publishes peer-reviewed research in various scientific and technological fields. The selection of articles is based on criteria such as originality, importance, interdisciplinary relevance, timeliness, accessibility, elegance, and surprising conclusions. In addition to showcasing significant scientific advances, Nature delivers rapid, authoritative, insightful news, and interpretation of current and upcoming trends impacting science, scientists, and the broader public. The journal serves a dual purpose: firstly, to promptly share noteworthy scientific advances and foster discussions among scientists, and secondly, to ensure the swift dissemination of scientific results globally, emphasizing their significance for knowledge, culture, and daily life.