Easy-PSAP: an integrated workflow to prioritize pathogenic variants in sequence data from a single individual.

IF 1.1 4区 生物学 Q4 GENETICS & HEREDITY
Human Heredity Pub Date : 2025-06-10 DOI:10.1159/000543671
Marie-Sophie C Ogloblinsky, Marc B Gros-La-Faige, Daniel P Lewinsohn, Mathilde Nguyen, Lourdes Velo-Suarez, Anthony Herzig, Thomas E Ludwig, Helen Castillo-Madeen, Donald F Conrad, Emmanuelle Génin, Gaëlle Marenne
{"title":"Easy-PSAP: an integrated workflow to prioritize pathogenic variants in sequence data from a single individual.","authors":"Marie-Sophie C Ogloblinsky, Marc B Gros-La-Faige, Daniel P Lewinsohn, Mathilde Nguyen, Lourdes Velo-Suarez, Anthony Herzig, Thomas E Ludwig, Helen Castillo-Madeen, Donald F Conrad, Emmanuelle Génin, Gaëlle Marenne","doi":"10.1159/000543671","DOIUrl":null,"url":null,"abstract":"<p><p>Introduction Next-Generation Sequencing data analysis has become an integral part of clinical genetic diagnosis, raising the question of variant prioritization. The Population Sampling Probability (PSAP) method has been developed to tackle the issue of variant prioritization in the exome of a single patient, by leveraging allele frequencies from population databases and a variant pathogenicity score. Methods Here, we present Easy-PSAP, a completely new implementation of the PSAP method comprising two user-friendly and highly adaptable pipelines. Easy-PSAP allows the gene-based recalibration of any in silico pathogenicity prediction score compared to scores of variants seen in the general population, including popular scores like CADD or AlphaMissense. Easy-PSAP can evaluate genetic variants at the scale of a whole exome or a whole genome using information from the latest population and annotation databases. Results Through simulations on synthetic disease exomes, we show that Easy-PSAP is able to rank more than 50% of causal pathogenic variants in the top 10 variants for an autosomal dominant model of transmission and in the top 1 for an autosomal recessive model of transmission. Discussion These findings, along with the accessibility of the pipeline to both researchers and clinicians, make Easy-PSAP a state-of-the-art tool for variant prioritization in Next Generation Sequencing (NGS) data that can continue to evolve as new frameworks and databases become available. Easy-PSAP is implemented in R and bash within an open-source Snakemake framework. It is available on GitHub alongside conda environments containing the required dependencies (https://github.com/msogloblinsky/Easy-PSAP).</p>","PeriodicalId":13226,"journal":{"name":"Human Heredity","volume":" ","pages":"1-11"},"PeriodicalIF":1.1000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Heredity","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1159/000543671","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction Next-Generation Sequencing data analysis has become an integral part of clinical genetic diagnosis, raising the question of variant prioritization. The Population Sampling Probability (PSAP) method has been developed to tackle the issue of variant prioritization in the exome of a single patient, by leveraging allele frequencies from population databases and a variant pathogenicity score. Methods Here, we present Easy-PSAP, a completely new implementation of the PSAP method comprising two user-friendly and highly adaptable pipelines. Easy-PSAP allows the gene-based recalibration of any in silico pathogenicity prediction score compared to scores of variants seen in the general population, including popular scores like CADD or AlphaMissense. Easy-PSAP can evaluate genetic variants at the scale of a whole exome or a whole genome using information from the latest population and annotation databases. Results Through simulations on synthetic disease exomes, we show that Easy-PSAP is able to rank more than 50% of causal pathogenic variants in the top 10 variants for an autosomal dominant model of transmission and in the top 1 for an autosomal recessive model of transmission. Discussion These findings, along with the accessibility of the pipeline to both researchers and clinicians, make Easy-PSAP a state-of-the-art tool for variant prioritization in Next Generation Sequencing (NGS) data that can continue to evolve as new frameworks and databases become available. Easy-PSAP is implemented in R and bash within an open-source Snakemake framework. It is available on GitHub alongside conda environments containing the required dependencies (https://github.com/msogloblinsky/Easy-PSAP).

Easy-PSAP:一个集成的工作流程,优先考虑来自单个个体的序列数据中的致病变异。
新一代测序数据分析已经成为临床遗传诊断的一个组成部分,提出了变异优先排序的问题。群体抽样概率(PSAP)方法通过利用群体数据库中的等位基因频率和变异致病性评分来解决单个患者外显子组中变异优先级的问题。在这里,我们提出了Easy-PSAP,一种全新的PSAP方法实现,包括两个用户友好且适应性强的管道。Easy-PSAP允许对任何基于基因的计算机致病性预测评分进行重新校准,与普通人群中看到的变异评分进行比较,包括CADD或AlphaMissense等流行评分。Easy-PSAP可以利用来自最新种群和注释数据库的信息,在整个外显子组或整个基因组的尺度上评估遗传变异。通过对合成疾病外显子组的模拟,我们发现Easy-PSAP能够将50%以上的致病变异排在常染色体显性传播模式的前10位变异中,并将常染色体隐性传播模式的前1位变异中。这些发现,以及对研究人员和临床医生的可访问性,使Easy-PSAP成为下一代测序(NGS)数据中变异优先排序的最先进工具,随着新的框架和数据库的出现,该工具可以继续发展。Easy-PSAP是在一个开源的Snakemake框架内用R和bash实现的。它可以在GitHub上与包含所需依赖项的conda环境一起使用(https://github.com/msogloblinsky/Easy-PSAP)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Human Heredity
Human Heredity 生物-遗传学
CiteScore
2.50
自引率
0.00%
发文量
12
审稿时长
>12 weeks
期刊介绍: Gathering original research reports and short communications from all over the world, ''Human Heredity'' is devoted to methodological and applied research on the genetics of human populations, association and linkage analysis, genetic mechanisms of disease, and new methods for statistical genetics, for example, analysis of rare variants and results from next generation sequencing. The value of this information to many branches of medicine is shown by the number of citations the journal receives in fields ranging from immunology and hematology to epidemiology and public health planning, and the fact that at least 50% of all ''Human Heredity'' papers are still cited more than 8 years after publication (according to ISI Journal Citation Reports). Special issues on methodological topics (such as ‘Consanguinity and Genomics’ in 2014; ‘Analyzing Rare Variants in Complex Diseases’ in 2012) or reviews of advances in particular fields (‘Genetic Diversity in European Populations: Evolutionary Evidence and Medical Implications’ in 2014; ‘Genes and the Environment in Obesity’ in 2013) are published every year. Renowned experts in the field are invited to contribute to these special issues.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信