Open-Source Bioinformatic Pipeline to Improve PMS2 Genetic Testing Using Short-Read NGS Data

IF 3.4 3区 医学 Q1 PATHOLOGY
{"title":"Open-Source Bioinformatic Pipeline to Improve PMS2 Genetic Testing Using Short-Read NGS Data","authors":"","doi":"10.1016/j.jmoldx.2024.05.005","DOIUrl":null,"url":null,"abstract":"<div><p>The molecular diagnosis of mismatch repair–deficient cancer syndromes is hampered by difficulties in sequencing the <em>PMS2</em> gene, mainly owing to the PMS2CL pseudogene. Next-generation sequencing short reads cannot be mapped unambiguously by standard pipelines, compromising variant calling accuracy. This study aimed to provide a refined bioinformatic pipeline for <em>PMS2</em> mutational analysis and explore <em>PMS2</em> germline pathogenic variant prevalence in an unselected hereditary cancer (HC) cohort. <em>PMS2</em> mutational analysis was optimized using two cohorts: 192 unselected HC patients for assessing the allelic ratio of paralogous sequence variants, and 13 samples enriched with <em>PMS2</em> (likely) pathogenic variants screened previously by long-range genomic DNA PCR amplification. Reads were forced to align with the <em>PMS2</em> reference sequence, except those corresponding to exon 11, where only those intersecting gene-specific invariant positions were considered. Afterward, the refined pipeline's accuracy was validated in a cohort of 40 patients and used to screen 5619 HC patients. Compared with our routine diagnostic pipeline, the PMS2_vaR pipeline showed increased technical sensitivity (0.853 to 0.956, respectively) in the validation cohort, identifying all previously <em>PMS2</em> pathogenic variants found by long-range genomic DNA PCR amplification. Fifteen HC cohort samples carried a pathogenic <em>PMS2</em> variant (15 of 5619; 0.285%), doubling the estimated prevalence in the general population. The refined open-source approach improved <em>PMS2</em> mutational analysis accuracy, allowing its inclusion in the routine next-generation sequencing pipeline streamlining <em>PMS2</em> screening.</p></div>","PeriodicalId":50128,"journal":{"name":"Journal of Molecular Diagnostics","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1525157824001181/pdfft?md5=91683b4368460b158919e0c187d84829&pid=1-s2.0-S1525157824001181-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1525157824001181","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The molecular diagnosis of mismatch repair–deficient cancer syndromes is hampered by difficulties in sequencing the PMS2 gene, mainly owing to the PMS2CL pseudogene. Next-generation sequencing short reads cannot be mapped unambiguously by standard pipelines, compromising variant calling accuracy. This study aimed to provide a refined bioinformatic pipeline for PMS2 mutational analysis and explore PMS2 germline pathogenic variant prevalence in an unselected hereditary cancer (HC) cohort. PMS2 mutational analysis was optimized using two cohorts: 192 unselected HC patients for assessing the allelic ratio of paralogous sequence variants, and 13 samples enriched with PMS2 (likely) pathogenic variants screened previously by long-range genomic DNA PCR amplification. Reads were forced to align with the PMS2 reference sequence, except those corresponding to exon 11, where only those intersecting gene-specific invariant positions were considered. Afterward, the refined pipeline's accuracy was validated in a cohort of 40 patients and used to screen 5619 HC patients. Compared with our routine diagnostic pipeline, the PMS2_vaR pipeline showed increased technical sensitivity (0.853 to 0.956, respectively) in the validation cohort, identifying all previously PMS2 pathogenic variants found by long-range genomic DNA PCR amplification. Fifteen HC cohort samples carried a pathogenic PMS2 variant (15 of 5619; 0.285%), doubling the estimated prevalence in the general population. The refined open-source approach improved PMS2 mutational analysis accuracy, allowing its inclusion in the routine next-generation sequencing pipeline streamlining PMS2 screening.

利用短读数 NGS 数据改进 PMS2 基因测试的开源生物信息学管道。
主要由于 PMS2CL 假基因导致的 PMS2 基因测序困难阻碍了 MMR 缺失癌症综合征的分子诊断。下一代测序(NGS)短读数无法通过标准管道进行明确映射,从而影响了变异调用的准确性。我们的目标是为 PMS2 基因突变分析提供一个完善的生物信息学管道,并在一个未选择的遗传性癌症(HC)队列中探索 PMS2 基因致病变体的流行率。利用两个队列对 PMS2 基因突变分析进行了优化:192例未经筛选的HC患者用于评估旁系序列变异的等位基因比例,13例样本富含PMS2(可能)致病变异,这些变异之前已通过长程gDNA PCR扩增(LR-PCR)进行了筛查。读数被强制与 PMS2 参考序列比对,但与第 11 号外显子对应的读数除外,只考虑与基因特异性不变位置相交的读数。之后,在 40 名患者的队列中验证了改进管道的准确性,并用于筛选 5619 名 HC 患者。与我们的常规诊断管道相比,PMS2_vaR 管道在验证队列中显示出更高的技术灵敏度(从 0.853 到 0.956),能识别出以前通过 LR-PCR 发现的所有 PMS2 致病变异。15个HC队列样本携带致病性PMS2变体(15/5619,0.285%),比一般人群的估计患病率高出一倍。改进后的开源方法提高了 PMS2 突变分析的准确性,使其能够纳入常规 NGS 管线,简化 PMS2 筛查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.10
自引率
2.40%
发文量
143
审稿时长
43 days
期刊介绍: The Journal of Molecular Diagnostics, the official publication of the Association for Molecular Pathology (AMP), co-owned by the American Society for Investigative Pathology (ASIP), seeks to publish high quality original papers on scientific advances in the translation and validation of molecular discoveries in medicine into the clinical diagnostic setting, and the description and application of technological advances in the field of molecular diagnostic medicine. The editors welcome for review articles that contain: novel discoveries or clinicopathologic correlations including studies in oncology, infectious diseases, inherited diseases, predisposition to disease, clinical informatics, or the description of polymorphisms linked to disease states or normal variations; the application of diagnostic methodologies in clinical trials; or the development of new or improved molecular methods which may be applied to diagnosis or monitoring of disease or disease predisposition.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信