Clinical Utility of Protein Language Models in Resolution of Variants of Uncertain Significance in KCNQ1, KCNH2, and SCN5A Compared With Patch-Clamp Functional Characterization.
Dan Ye, Ramin Garmany, Estefania Martinez-Barrios, Xiaozhi Gao, Raquel Almeida Lopes Neves, David J Tester, Sahej Bains, Wei Zhou, John R Giudicessi, Michael J Ackerman
{"title":"Clinical Utility of Protein Language Models in Resolution of Variants of Uncertain Significance in <i>KCNQ1, KCNH2</i>, and <i>SCN5A</i> Compared With Patch-Clamp Functional Characterization.","authors":"Dan Ye, Ramin Garmany, Estefania Martinez-Barrios, Xiaozhi Gao, Raquel Almeida Lopes Neves, David J Tester, Sahej Bains, Wei Zhou, John R Giudicessi, Michael J Ackerman","doi":"10.1161/CIRCGEN.124.004584","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Genetic testing for cardiac channelopathies is the standard of care. However, many rare genetic variants remain classified as variants of uncertain significance (VUS) due to lack of epidemiological and functional data. Whether deep protein language models may aid in VUS resolution remains unknown. Here, we set out to compare how 2 deep protein language models perform at VUS resolution in the 3 most common long-QT syndrome-causative genes compared with the gold-standard patch clamp.</p><p><strong>Methods: </strong>A total of 72 rare nonsynonymous VUS (9 <i>KCNQ1,</i> 19 <i>KCNH2</i>, and 50 <i>SCN5A</i>) were engineered by site-directed mutagenesis and expressed in either HEK293 cells or TSA201 cells. Whole-cell patch-clamp technique was used to functionally characterize these variants. The protein language models, evolutionary scale modeling, version 1b and AlphaMissense, were used to predict the variant effect of missense variants and compared with patch clamp.</p><p><strong>Results: </strong>Considering variants in all 3 genes, the evolutionary scale modeling, version 1b model had a receiver operating characteristic curve-area under the curve of 0.75 (<i>P</i>=0.0003). It had a sensitivity of 88% and a specificity of 50%. AlphaMissense performed well compared with patch-clamp with an receiver operating characteristic curve-area under the curve of 0.85 (<i>P</i><0.0001), sensitivity of 80%, and specificity of 76%.</p><p><strong>Conclusions: </strong>Deep protein language models aid in VUS resolution with high sensitivity but lower specificity. Thus, these tools cannot fully replace functional characterization but can aid in reducing the number of variants that may require functional analysis.</p>","PeriodicalId":10326,"journal":{"name":"Circulation: Genomic and Precision Medicine","volume":" ","pages":"e004584"},"PeriodicalIF":6.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Circulation: Genomic and Precision Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1161/CIRCGEN.124.004584","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/9 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Genetic testing for cardiac channelopathies is the standard of care. However, many rare genetic variants remain classified as variants of uncertain significance (VUS) due to lack of epidemiological and functional data. Whether deep protein language models may aid in VUS resolution remains unknown. Here, we set out to compare how 2 deep protein language models perform at VUS resolution in the 3 most common long-QT syndrome-causative genes compared with the gold-standard patch clamp.
Methods: A total of 72 rare nonsynonymous VUS (9 KCNQ1, 19 KCNH2, and 50 SCN5A) were engineered by site-directed mutagenesis and expressed in either HEK293 cells or TSA201 cells. Whole-cell patch-clamp technique was used to functionally characterize these variants. The protein language models, evolutionary scale modeling, version 1b and AlphaMissense, were used to predict the variant effect of missense variants and compared with patch clamp.
Results: Considering variants in all 3 genes, the evolutionary scale modeling, version 1b model had a receiver operating characteristic curve-area under the curve of 0.75 (P=0.0003). It had a sensitivity of 88% and a specificity of 50%. AlphaMissense performed well compared with patch-clamp with an receiver operating characteristic curve-area under the curve of 0.85 (P<0.0001), sensitivity of 80%, and specificity of 76%.
Conclusions: Deep protein language models aid in VUS resolution with high sensitivity but lower specificity. Thus, these tools cannot fully replace functional characterization but can aid in reducing the number of variants that may require functional analysis.
期刊介绍:
Circulation: Genomic and Precision Medicine is a distinguished journal dedicated to advancing the frontiers of cardiovascular genomics and precision medicine. It publishes a diverse array of original research articles that delve into the genetic and molecular underpinnings of cardiovascular diseases. The journal's scope is broad, encompassing studies from human subjects to laboratory models, and from in vitro experiments to computational simulations.
Circulation: Genomic and Precision Medicine is committed to publishing studies that have direct relevance to human cardiovascular biology and disease, with the ultimate goal of improving patient care and outcomes. The journal serves as a platform for researchers to share their groundbreaking work, fostering collaboration and innovation in the field of cardiovascular genomics and precision medicine.