Predicting the Functional Impact of KCNQ1 Variants of Unknown Significance.

Bian Li, Jeffrey L Mendenhall, Brett M Kroncke, Keenan C Taylor, Hui Huang, Derek K Smith, Carlos G Vanoye, Jeffrey D Blume, Alfred L George, Charles R Sanders, Jens Meiler
{"title":"Predicting the Functional Impact of KCNQ1 Variants of Unknown Significance.","authors":"Bian Li,&nbsp;Jeffrey L Mendenhall,&nbsp;Brett M Kroncke,&nbsp;Keenan C Taylor,&nbsp;Hui Huang,&nbsp;Derek K Smith,&nbsp;Carlos G Vanoye,&nbsp;Jeffrey D Blume,&nbsp;Alfred L George,&nbsp;Charles R Sanders,&nbsp;Jens Meiler","doi":"10.1161/CIRCGENETICS.117.001754","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>An emerging standard-of-care for long-QT syndrome uses clinical genetic testing to identify genetic variants of the KCNQ1 potassium channel. However, interpreting results from genetic testing is confounded by the presence of variants of unknown significance for which there is inadequate evidence of pathogenicity.</p><p><strong>Methods and results: </strong>In this study, we curated from the literature a high-quality set of 107 functionally characterized KCNQ1 variants. Based on this data set, we completed a detailed quantitative analysis on the sequence conservation patterns of subdomains of KCNQ1 and the distribution of pathogenic variants therein. We found that conserved subdomains generally are critical for channel function and are enriched with dysfunctional variants. Using this experimentally validated data set, we trained a neural network, designated Q1VarPred, specifically for predicting the functional impact of KCNQ1 variants of unknown significance. The estimated predictive performance of Q1VarPred in terms of Matthew's correlation coefficient and area under the receiver operating characteristic curve were 0.581 and 0.884, respectively, superior to the performance of 8 previous methods tested in parallel. Q1VarPred is publicly available as a web server at http://meilerlab.org/q1varpred.</p><p><strong>Conclusions: </strong>Although a plethora of tools are available for making pathogenicity predictions over a genome-wide scale, previous tools fail to perform in a robust manner when applied to KCNQ1. The contrasting and favorable results for Q1VarPred suggest a promising approach, where a machine-learning algorithm is tailored to a specific protein target and trained with a functionally validated data set to calibrate informatics tools.</p>","PeriodicalId":10277,"journal":{"name":"Circulation: Cardiovascular Genetics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1161/CIRCGENETICS.117.001754","citationCount":"34","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Circulation: Cardiovascular Genetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1161/CIRCGENETICS.117.001754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 34

Abstract

Background: An emerging standard-of-care for long-QT syndrome uses clinical genetic testing to identify genetic variants of the KCNQ1 potassium channel. However, interpreting results from genetic testing is confounded by the presence of variants of unknown significance for which there is inadequate evidence of pathogenicity.

Methods and results: In this study, we curated from the literature a high-quality set of 107 functionally characterized KCNQ1 variants. Based on this data set, we completed a detailed quantitative analysis on the sequence conservation patterns of subdomains of KCNQ1 and the distribution of pathogenic variants therein. We found that conserved subdomains generally are critical for channel function and are enriched with dysfunctional variants. Using this experimentally validated data set, we trained a neural network, designated Q1VarPred, specifically for predicting the functional impact of KCNQ1 variants of unknown significance. The estimated predictive performance of Q1VarPred in terms of Matthew's correlation coefficient and area under the receiver operating characteristic curve were 0.581 and 0.884, respectively, superior to the performance of 8 previous methods tested in parallel. Q1VarPred is publicly available as a web server at http://meilerlab.org/q1varpred.

Conclusions: Although a plethora of tools are available for making pathogenicity predictions over a genome-wide scale, previous tools fail to perform in a robust manner when applied to KCNQ1. The contrasting and favorable results for Q1VarPred suggest a promising approach, where a machine-learning algorithm is tailored to a specific protein target and trained with a functionally validated data set to calibrate informatics tools.

Abstract Image

Abstract Image

Abstract Image

预测未知意义的KCNQ1变异的功能影响。
背景:一种新兴的长qt综合征护理标准使用临床基因检测来识别KCNQ1钾通道的遗传变异。然而,对基因检测结果的解释因存在意义未知的变异而混淆,这些变异的致病性证据不足。方法和结果:在本研究中,我们从文献中筛选了107个功能性表征的KCNQ1变异。基于该数据集,我们完成了对KCNQ1亚结构域序列保守模式及其致病变异分布的详细定量分析。我们发现保守子结构域通常对通道功能至关重要,并且富含功能失调的变体。利用这个实验验证的数据集,我们训练了一个神经网络,命名为Q1VarPred,专门用于预测未知意义的KCNQ1变异的功能影响。Q1VarPred的马修相关系数和受试者工作特征曲线下面积的估计预测性能分别为0.581和0.884,优于之前并行测试的8种方法。Q1VarPred作为web服务器在http://meilerlab.org/q1varpred.Conclusions:上公开可用,尽管有大量的工具可用于在全基因组范围内进行致病性预测,但以前的工具在应用于KCNQ1时无法以稳健的方式执行。Q1VarPred的对比结果表明,这是一种很有前途的方法,其中机器学习算法针对特定的蛋白质靶标进行定制,并使用功能验证的数据集进行训练,以校准信息学工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Circulation: Cardiovascular Genetics
Circulation: Cardiovascular Genetics CARDIAC & CARDIOVASCULAR SYSTEMS-GENETICS & HEREDITY
自引率
0.00%
发文量
0
审稿时长
6-12 weeks
期刊介绍: Circulation: Genomic and Precision Medicine considers all types of original research articles, including studies conducted in human subjects, laboratory animals, in vitro, and in silico. Articles may include investigations of: clinical genetics as applied to the diagnosis and management of monogenic or oligogenic cardiovascular disorders; the molecular basis of complex cardiovascular disorders, including genome-wide association studies, exome and genome sequencing-based association studies, coding variant association studies, genetic linkage studies, epigenomics, transcriptomics, proteomics, metabolomics, and metagenomics; integration of electronic health record data or patient-generated data with any of the aforementioned approaches, including phenome-wide association studies, or with environmental or lifestyle factors; pharmacogenomics; regulation of gene expression; gene therapy and therapeutic genomic editing; systems biology approaches to the diagnosis and management of cardiovascular disorders; novel methods to perform any of the aforementioned studies; and novel applications of precision medicine. Above all, we seek studies with relevance to human cardiovascular biology and disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信