Structure-based network analysis predicts pathogenic variants in human proteins associated with inherited retinal disease.

IF 4.7 2区 医学 Q1 GENETICS & HEREDITY
Blake M Hauser, Yuyang Luo, Anusha Nathan, Ahmad Al-Moujahed, Demetrios G Vavvas, Jason Comander, Eric A Pierce, Emily M Place, Kinga M Bujakowska, Gaurav D Gaiha, Elizabeth J Rossin
{"title":"Structure-based network analysis predicts pathogenic variants in human proteins associated with inherited retinal disease.","authors":"Blake M Hauser, Yuyang Luo, Anusha Nathan, Ahmad Al-Moujahed, Demetrios G Vavvas, Jason Comander, Eric A Pierce, Emily M Place, Kinga M Bujakowska, Gaurav D Gaiha, Elizabeth J Rossin","doi":"10.1038/s41525-024-00416-w","DOIUrl":null,"url":null,"abstract":"<p><p>Advances in gene sequencing technologies have accelerated the identification of genetic variants, but better tools are needed to understand which are causal of disease. This would be particularly useful in fields where gene therapy is a potential therapeutic modality for a disease-causing variant such as inherited retinal disease (IRD). Here, we apply structure-based network analysis (SBNA), which has been successfully utilized to identify variant-constrained amino acid residues in viral proteins, to identify residues that may cause IRD if subject to missense mutation. SBNA is based entirely on structural first principles and is not fit to specific outcome data, which makes it distinct from other contemporary missense prediction tools. In 4 well-studied human disease-associated proteins (BRCA1, HRAS, PTEN, and ERK2) with high-quality structural data, we find that SBNA scores correlate strongly with deep mutagenesis data. When applied to 47 IRD genes with available high-quality crystal structure data, SBNA scores reliably identified disease-causing variants according to phenotype definitions from the ClinVar database. Finally, we applied this approach to 63 patients at Massachusetts Eye and Ear (MEE) with IRD but for whom no genetic cause had been identified. Untrained models built using SBNA scores and BLOSUM62 scores for IRD-associated genes successfully predicted the pathogenicity of novel variants (AUC = 0.851), allowing us to identify likely causative disease variants in 40 IRD patients. Model performance was further augmented by incorporating orthogonal data from EVE scores (AUC = 0.927), which are based on evolutionary multiple sequence alignments. In conclusion, SBNA can used to successfully identify variants as causal of disease in human proteins and may help predict variants causative of IRD in an unbiased fashion.</p>","PeriodicalId":19273,"journal":{"name":"NPJ Genomic Medicine","volume":"9 1","pages":"31"},"PeriodicalIF":4.7000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11130145/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Genomic Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41525-024-00416-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Advances in gene sequencing technologies have accelerated the identification of genetic variants, but better tools are needed to understand which are causal of disease. This would be particularly useful in fields where gene therapy is a potential therapeutic modality for a disease-causing variant such as inherited retinal disease (IRD). Here, we apply structure-based network analysis (SBNA), which has been successfully utilized to identify variant-constrained amino acid residues in viral proteins, to identify residues that may cause IRD if subject to missense mutation. SBNA is based entirely on structural first principles and is not fit to specific outcome data, which makes it distinct from other contemporary missense prediction tools. In 4 well-studied human disease-associated proteins (BRCA1, HRAS, PTEN, and ERK2) with high-quality structural data, we find that SBNA scores correlate strongly with deep mutagenesis data. When applied to 47 IRD genes with available high-quality crystal structure data, SBNA scores reliably identified disease-causing variants according to phenotype definitions from the ClinVar database. Finally, we applied this approach to 63 patients at Massachusetts Eye and Ear (MEE) with IRD but for whom no genetic cause had been identified. Untrained models built using SBNA scores and BLOSUM62 scores for IRD-associated genes successfully predicted the pathogenicity of novel variants (AUC = 0.851), allowing us to identify likely causative disease variants in 40 IRD patients. Model performance was further augmented by incorporating orthogonal data from EVE scores (AUC = 0.927), which are based on evolutionary multiple sequence alignments. In conclusion, SBNA can used to successfully identify variants as causal of disease in human proteins and may help predict variants causative of IRD in an unbiased fashion.

Abstract Image

基于结构的网络分析预测了与遗传性视网膜疾病相关的人类蛋白质中的致病变体。
基因测序技术的进步加快了基因变异的鉴定速度,但还需要更好的工具来了解哪些基因变异是致病因素。这在基因治疗是一种潜在的致病变异治疗方法的领域尤其有用,如遗传性视网膜病(IRD)。在这里,我们应用基于结构的网络分析(SBNA)来识别病毒蛋白质中受变异约束的氨基酸残基,该方法已成功用于识别发生错义突变时可能导致 IRD 的残基。SBNA 完全基于结构第一性原理,并不与特定结果数据相匹配,这使其有别于其他当代的错义预测工具。在具有高质量结构数据的 4 个研究充分的人类疾病相关蛋白(BRCA1、HRAS、PTEN 和 ERK2)中,我们发现 SBNA 分数与深度诱变数据密切相关。当应用于 47 个具有高质量晶体结构数据的 IRD 基因时,根据 ClinVar 数据库中的表型定义,SBNA 评分可靠地鉴定出了致病变体。最后,我们将这种方法应用于马萨诸塞眼耳科(MEE)的 63 名 IRD 患者,这些患者的遗传病因尚未确定。使用 IRD 相关基因的 SBNA 评分和 BLOSUM62 评分建立的未训练模型成功预测了新型变异体的致病性(AUC = 0.851),使我们能够确定 40 例 IRD 患者中可能的致病变异体。基于进化多序列比对的 EVE 评分(AUC = 0.927)纳入了正交数据,进一步提高了模型性能。总之,SBNA 可用于成功识别人类蛋白质中的致病变异,并有助于以无偏见的方式预测 IRD 的致病变异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
NPJ Genomic Medicine
NPJ Genomic Medicine Biochemistry, Genetics and Molecular Biology-Molecular Biology
CiteScore
9.40
自引率
1.90%
发文量
67
审稿时长
17 weeks
期刊介绍: npj Genomic Medicine is an international, peer-reviewed journal dedicated to publishing the most important scientific advances in all aspects of genomics and its application in the practice of medicine. The journal defines genomic medicine as "diagnosis, prognosis, prevention and/or treatment of disease and disorders of the mind and body, using approaches informed or enabled by knowledge of the genome and the molecules it encodes." Relevant and high-impact papers that encompass studies of individuals, families, or populations are considered for publication. An emphasis will include coupling detailed phenotype and genome sequencing information, both enabled by new technologies and informatics, to delineate the underlying aetiology of disease. Clinical recommendations and/or guidelines of how that data should be used in the clinical management of those patients in the study, and others, are also encouraged.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信