利用异质网络和增强负抽样预测tf靶基因关联。

IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS
Bioinformatics and Biology Insights Pub Date : 2025-02-25 eCollection Date: 2025-01-01 DOI:10.1177/11779322251316130
Thanh Tuoi Le, Xuan Tho Dang
{"title":"利用异质网络和增强负抽样预测tf靶基因关联。","authors":"Thanh Tuoi Le, Xuan Tho Dang","doi":"10.1177/11779322251316130","DOIUrl":null,"url":null,"abstract":"<p><p>Identifying interactions between transcription factors (TFs) and target genes is crucial for understanding the molecular mechanisms involved in biological processes and diseases. Traditional biological experiments used to determine these interactions are often time-consuming, costly, and limited in scale. Current computational methods mainly predict binding sites rather than direct interactions. Although recent studies have achieved high performance in predicting TF-target gene associations, they still face a significant challenge related to constructing a robust dataset of positive and negative samples. Currently, methods do not adequately focus on selecting negative samples, resulting in incomplete coverage of potential TF-target gene relationships. This article proposes a method to select enhanced negative samples to improve the prediction performance of TF-target gene interactions. Experimental results show that the proposed method achieves an average area under the curve (AUC) value of 0.9024 ± 0.0008 through 5-fold cross-validation. These results demonstrate the model's high efficiency and accuracy, confirming its potential application in predicting TF-target gene interactions across various datasets and paving the way for large-scale biomedical research.</p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"19 ","pages":"11779322251316130"},"PeriodicalIF":2.3000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11863233/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting TF-Target Gene Association Using a Heterogeneous Network and Enhanced Negative Sampling.\",\"authors\":\"Thanh Tuoi Le, Xuan Tho Dang\",\"doi\":\"10.1177/11779322251316130\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Identifying interactions between transcription factors (TFs) and target genes is crucial for understanding the molecular mechanisms involved in biological processes and diseases. Traditional biological experiments used to determine these interactions are often time-consuming, costly, and limited in scale. Current computational methods mainly predict binding sites rather than direct interactions. Although recent studies have achieved high performance in predicting TF-target gene associations, they still face a significant challenge related to constructing a robust dataset of positive and negative samples. Currently, methods do not adequately focus on selecting negative samples, resulting in incomplete coverage of potential TF-target gene relationships. This article proposes a method to select enhanced negative samples to improve the prediction performance of TF-target gene interactions. Experimental results show that the proposed method achieves an average area under the curve (AUC) value of 0.9024 ± 0.0008 through 5-fold cross-validation. These results demonstrate the model's high efficiency and accuracy, confirming its potential application in predicting TF-target gene interactions across various datasets and paving the way for large-scale biomedical research.</p>\",\"PeriodicalId\":9065,\"journal\":{\"name\":\"Bioinformatics and Biology Insights\",\"volume\":\"19 \",\"pages\":\"11779322251316130\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-02-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11863233/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics and Biology Insights\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/11779322251316130\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics and Biology Insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11779322251316130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

确定转录因子(TFs)和靶基因之间的相互作用对于理解参与生物过程和疾病的分子机制至关重要。用于确定这些相互作用的传统生物学实验通常耗时、昂贵且规模有限。目前的计算方法主要是预测结合位点而不是直接相互作用。尽管最近的研究在预测tf靶基因关联方面取得了很高的成绩,但他们仍然面临着与构建一个强大的阳性和阴性样本数据集相关的重大挑战。目前,方法没有充分关注选择阴性样本,导致潜在tf靶基因关系的不完全覆盖。本文提出了一种选择增强负样本的方法,以提高tf -靶基因相互作用的预测性能。实验结果表明,通过5次交叉验证,该方法的平均曲线下面积(AUC)值为0.9024±0.0008。这些结果证明了该模型的高效率和准确性,证实了其在预测tf靶基因相互作用方面的潜在应用,并为大规模生物医学研究铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting TF-Target Gene Association Using a Heterogeneous Network and Enhanced Negative Sampling.

Identifying interactions between transcription factors (TFs) and target genes is crucial for understanding the molecular mechanisms involved in biological processes and diseases. Traditional biological experiments used to determine these interactions are often time-consuming, costly, and limited in scale. Current computational methods mainly predict binding sites rather than direct interactions. Although recent studies have achieved high performance in predicting TF-target gene associations, they still face a significant challenge related to constructing a robust dataset of positive and negative samples. Currently, methods do not adequately focus on selecting negative samples, resulting in incomplete coverage of potential TF-target gene relationships. This article proposes a method to select enhanced negative samples to improve the prediction performance of TF-target gene interactions. Experimental results show that the proposed method achieves an average area under the curve (AUC) value of 0.9024 ± 0.0008 through 5-fold cross-validation. These results demonstrate the model's high efficiency and accuracy, confirming its potential application in predicting TF-target gene interactions across various datasets and paving the way for large-scale biomedical research.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Bioinformatics and Biology Insights
Bioinformatics and Biology Insights BIOCHEMICAL RESEARCH METHODS-
CiteScore
6.80
自引率
1.70%
发文量
36
审稿时长
8 weeks
期刊介绍: Bioinformatics and Biology Insights is an open access, peer-reviewed journal that considers articles on bioinformatics methods and their applications which must pertain to biological insights. All papers should be easily amenable to biologists and as such help bridge the gap between theories and applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信