Mohammadmersad Ghorbani, Shabir Moosa, Zenab Siddig, Radi Farhad, Haroon Naeem, William T. Harvey, Francesco Kumara Mastrorosa, Katherine M. Munson, Rozaimi Mohamad Razali, Elbay Aliyev, Ilhame Diboun, Rawan Abouelhassan, Melissa Tauro, Sondoss Hassan, Rebecca Mathew, Muna Al Hashmi, Lisa S. Mathew, Kun Wang, Abdul Rahman Salhab, Fazulur Rehaman Vempalli, Ahmed El Khouly, Qatar Genome Program Research Consortium, Iman Alazwani, Sara Tomei, Khalid A. Fakhro, Alia Satti, Ruba Benini, Arang Rhie, Evan E. Eichler, Younes Mokrab
{"title":"接近完整的中东基因组改进了自合子性,增强了致病和人群特异性变异的发现","authors":"Mohammadmersad Ghorbani, Shabir Moosa, Zenab Siddig, Radi Farhad, Haroon Naeem, William T. Harvey, Francesco Kumara Mastrorosa, Katherine M. Munson, Rozaimi Mohamad Razali, Elbay Aliyev, Ilhame Diboun, Rawan Abouelhassan, Melissa Tauro, Sondoss Hassan, Rebecca Mathew, Muna Al Hashmi, Lisa S. Mathew, Kun Wang, Abdul Rahman Salhab, Fazulur Rehaman Vempalli, Ahmed El Khouly, Qatar Genome Program Research Consortium, Iman Alazwani, Sara Tomei, Khalid A. Fakhro, Alia Satti, Ruba Benini, Arang Rhie, Evan E. Eichler, Younes Mokrab","doi":"10.1038/s41588-025-02173-7","DOIUrl":null,"url":null,"abstract":"Advances in long-read sequencing have enabled routine complete assembly of human genomes, but much remains to be done to represent broader populations and show impact on disease-gene discovery. Here, we report highly accurate, near-complete and phased genomes from six Middle Eastern (ME) family trios (n = 18) with neurodevelopmental conditions, representing ancestries from Sudan, Jordan, Syria, Qatar and Afghanistan. These genomes revealed 42.2 Mb of new sequence (13.8% impacting known genes), 75 new HLA/KIR alleles and strong signals of inbreeding, with ROH covering up to one-third of chromosomes 6 and 12 in one individual. Using assembly-based variant calling, we identified 23 de novo and recessive variants as strong candidates for causing previously unresolved symptoms in the probands. The ME genomes revealed unique variation relative to existing references, showing enhanced mappability and variant calling. These results underscore the value of de novo assembly for disease variant discovery and the need for sampled ME-specific references to better characterize population-relevant variation. Generation and analysis of high-quality, genome assemblies from Middle Eastern trios demonstrate the utility of ancestry-matched data and assembly-based variation analysis.","PeriodicalId":18985,"journal":{"name":"Nature genetics","volume":"57 5","pages":"1119-1131"},"PeriodicalIF":31.7000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41588-025-02173-7.pdf","citationCount":"0","resultStr":"{\"title\":\"Near-complete Middle Eastern genomes refine autozygosity and enhance disease-causing and population-specific variant discovery\",\"authors\":\"Mohammadmersad Ghorbani, Shabir Moosa, Zenab Siddig, Radi Farhad, Haroon Naeem, William T. Harvey, Francesco Kumara Mastrorosa, Katherine M. Munson, Rozaimi Mohamad Razali, Elbay Aliyev, Ilhame Diboun, Rawan Abouelhassan, Melissa Tauro, Sondoss Hassan, Rebecca Mathew, Muna Al Hashmi, Lisa S. Mathew, Kun Wang, Abdul Rahman Salhab, Fazulur Rehaman Vempalli, Ahmed El Khouly, Qatar Genome Program Research Consortium, Iman Alazwani, Sara Tomei, Khalid A. Fakhro, Alia Satti, Ruba Benini, Arang Rhie, Evan E. Eichler, Younes Mokrab\",\"doi\":\"10.1038/s41588-025-02173-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Advances in long-read sequencing have enabled routine complete assembly of human genomes, but much remains to be done to represent broader populations and show impact on disease-gene discovery. Here, we report highly accurate, near-complete and phased genomes from six Middle Eastern (ME) family trios (n = 18) with neurodevelopmental conditions, representing ancestries from Sudan, Jordan, Syria, Qatar and Afghanistan. These genomes revealed 42.2 Mb of new sequence (13.8% impacting known genes), 75 new HLA/KIR alleles and strong signals of inbreeding, with ROH covering up to one-third of chromosomes 6 and 12 in one individual. Using assembly-based variant calling, we identified 23 de novo and recessive variants as strong candidates for causing previously unresolved symptoms in the probands. The ME genomes revealed unique variation relative to existing references, showing enhanced mappability and variant calling. These results underscore the value of de novo assembly for disease variant discovery and the need for sampled ME-specific references to better characterize population-relevant variation. Generation and analysis of high-quality, genome assemblies from Middle Eastern trios demonstrate the utility of ancestry-matched data and assembly-based variation analysis.\",\"PeriodicalId\":18985,\"journal\":{\"name\":\"Nature genetics\",\"volume\":\"57 5\",\"pages\":\"1119-1131\"},\"PeriodicalIF\":31.7000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.nature.com/articles/s41588-025-02173-7.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.nature.com/articles/s41588-025-02173-7\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature genetics","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41588-025-02173-7","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
Near-complete Middle Eastern genomes refine autozygosity and enhance disease-causing and population-specific variant discovery
Advances in long-read sequencing have enabled routine complete assembly of human genomes, but much remains to be done to represent broader populations and show impact on disease-gene discovery. Here, we report highly accurate, near-complete and phased genomes from six Middle Eastern (ME) family trios (n = 18) with neurodevelopmental conditions, representing ancestries from Sudan, Jordan, Syria, Qatar and Afghanistan. These genomes revealed 42.2 Mb of new sequence (13.8% impacting known genes), 75 new HLA/KIR alleles and strong signals of inbreeding, with ROH covering up to one-third of chromosomes 6 and 12 in one individual. Using assembly-based variant calling, we identified 23 de novo and recessive variants as strong candidates for causing previously unresolved symptoms in the probands. The ME genomes revealed unique variation relative to existing references, showing enhanced mappability and variant calling. These results underscore the value of de novo assembly for disease variant discovery and the need for sampled ME-specific references to better characterize population-relevant variation. Generation and analysis of high-quality, genome assemblies from Middle Eastern trios demonstrate the utility of ancestry-matched data and assembly-based variation analysis.
期刊介绍:
Nature Genetics publishes the very highest quality research in genetics. It encompasses genetic and functional genomic studies on human and plant traits and on other model organisms. Current emphasis is on the genetic basis for common and complex diseases and on the functional mechanism, architecture and evolution of gene networks, studied by experimental perturbation.
Integrative genetic topics comprise, but are not limited to:
-Genes in the pathology of human disease
-Molecular analysis of simple and complex genetic traits
-Cancer genetics
-Agricultural genomics
-Developmental genetics
-Regulatory variation in gene expression
-Strategies and technologies for extracting function from genomic data
-Pharmacological genomics
-Genome evolution