病原同义突变的特征驱动预测模型。

IF 5.3 2区化学 Q1 CHEMISTRY, MEDICINAL

Journal of Chemical Information and Modeling Pub Date : 2025-03-24 Epub Date: 2025-03-13 DOI:10.1021/acs.jcim.4c02139

Fangfang Jin, Na Cheng, Lihua Wang, Bin Ye, Junfeng Xia

{"title":"病原同义突变的特征驱动预测模型。","authors":"Fangfang Jin, Na Cheng, Lihua Wang, Bin Ye, Junfeng Xia","doi":"10.1021/acs.jcim.4c02139","DOIUrl":null,"url":null,"abstract":"Synonymous mutations, once considered to be biologically neutral, are now recognized to affect protein expression and function by altering the RNA splicing, stability, or translation efficiency. These effects can contribute to disease, making the prediction of the pathogenicity a crucial task. Computational methods have been developed to analyze the sequence features and biological functions of synonymous mutations, but existing methods face limitations, including scarcity of labeled data, reliance on other prediction tools, and insufficient representation of feature interrelationships. Here, we present FDPSM, a novel prediction method specifically designed to predict pathogenic synonymous mutations. FDPSM was trained on a robust data set of 4251 positive and negative training samples to enhance predictive accuracy. The method leveraged a comprehensive set of features, including genomic context, conservation, splicing effects, functional effects, and epigenomics, without relying on prediction scores from other mutation pathogenicity tools. Recognizing that original features alone may not fully capture the distinctions between pathogenic and benign synonymous mutations, we enhanced the feature set by extracting effective information from the interactions and distribution of these features. The experimental results showed that FDPSM significantly outperformed existing methods in predicting the pathogenicity of synonymous mutations, offering a more accurate and reliable tool for this important task. FDPSM is available at https://github.com/xialab-ahu/FDPSM.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"3064-3076"},"PeriodicalIF":5.3000,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FDPSM: Feature-Driven Prediction Modeling of Pathogenic Synonymous Mutations.\",\"authors\":\"Fangfang Jin, Na Cheng, Lihua Wang, Bin Ye, Junfeng Xia\",\"doi\":\"10.1021/acs.jcim.4c02139\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Synonymous mutations, once considered to be biologically neutral, are now recognized to affect protein expression and function by altering the RNA splicing, stability, or translation efficiency. These effects can contribute to disease, making the prediction of the pathogenicity a crucial task. Computational methods have been developed to analyze the sequence features and biological functions of synonymous mutations, but existing methods face limitations, including scarcity of labeled data, reliance on other prediction tools, and insufficient representation of feature interrelationships. Here, we present FDPSM, a novel prediction method specifically designed to predict pathogenic synonymous mutations. FDPSM was trained on a robust data set of 4251 positive and negative training samples to enhance predictive accuracy. The method leveraged a comprehensive set of features, including genomic context, conservation, splicing effects, functional effects, and epigenomics, without relying on prediction scores from other mutation pathogenicity tools. Recognizing that original features alone may not fully capture the distinctions between pathogenic and benign synonymous mutations, we enhanced the feature set by extracting effective information from the interactions and distribution of these features. The experimental results showed that FDPSM significantly outperformed existing methods in predicting the pathogenicity of synonymous mutations, offering a more accurate and reliable tool for this important task. FDPSM is available at https://github.com/xialab-ahu/FDPSM.\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\" \",\"pages\":\"3064-3076\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.4c02139\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/13 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c02139","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}

引用次数: 0

摘要

同义突变，曾经被认为是生物学上中性的，现在被认为是通过改变RNA剪接、稳定性或翻译效率来影响蛋白质的表达和功能。这些作用可能导致疾病，因此预测致病性是一项至关重要的任务。计算方法已被开发用于分析同义突变的序列特征和生物学功能，但现有方法面临局限性，包括标记数据的稀缺性，依赖于其他预测工具，以及特征相互关系的不充分表示。在这里，我们提出了FDPSM，一种专门用于预测致病性同义突变的新型预测方法。FDPSM在4251个阳性和阴性训练样本的鲁棒数据集上进行训练，以提高预测准确性。该方法利用了一套全面的特征，包括基因组背景、保守性、剪接效应、功能效应和表观基因组学，而不依赖于其他突变致病性工具的预测评分。认识到单独的原始特征可能无法完全捕获致病和良性同义突变之间的区别，我们通过从这些特征的相互作用和分布中提取有效信息来增强特征集。实验结果表明，FDPSM在预测同义突变致病性方面明显优于现有方法，为这一重要任务提供了更准确、更可靠的工具。FDPSM可在https://github.com/xialab-ahu/FDPSM上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FDPSM: Feature-Driven Prediction Modeling of Pathogenic Synonymous Mutations.

Synonymous mutations, once considered to be biologically neutral, are now recognized to affect protein expression and function by altering the RNA splicing, stability, or translation efficiency. These effects can contribute to disease, making the prediction of the pathogenicity a crucial task. Computational methods have been developed to analyze the sequence features and biological functions of synonymous mutations, but existing methods face limitations, including scarcity of labeled data, reliance on other prediction tools, and insufficient representation of feature interrelationships. Here, we present FDPSM, a novel prediction method specifically designed to predict pathogenic synonymous mutations. FDPSM was trained on a robust data set of 4251 positive and negative training samples to enhance predictive accuracy. The method leveraged a comprehensive set of features, including genomic context, conservation, splicing effects, functional effects, and epigenomics, without relying on prediction scores from other mutation pathogenicity tools. Recognizing that original features alone may not fully capture the distinctions between pathogenic and benign synonymous mutations, we enhanced the feature set by extracting effective information from the interactions and distribution of these features. The experimental results showed that FDPSM significantly outperformed existing methods in predicting the pathogenicity of synonymous mutations, offering a more accurate and reliable tool for this important task. FDPSM is available at https://github.com/xialab-ahu/FDPSM.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Chemical Information and Modeling 化学-化学综合

CiteScore

9.80

自引率

10.70%

发文量

529

审稿时长

1.4 months

期刊介绍： The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.