{"title":"Predicting disease associations based on the higher order structure of ceRNA networks.","authors":"Zhaoliang Chai, Ying Su, Xuecong Tian, Chen Chen, Xiaoyi Lv, Cheng Chen","doi":"10.1093/bib/bbaf518","DOIUrl":null,"url":null,"abstract":"<p><p>Competitive endogenous RNA (ceRNA) network regulation is an important posttranscriptional regulatory mechanism that plays an important role in physiological and pathological processes, and has been widely used in biomarker screening and regulatory factor studies of disease-related genes. However, existing studies have mainly focused on the association of a single type of RNA with disease, while studies targeting the application of ceRNA networks in disease prediction are still limited, so it is crucial to explore the potential of ceRNA networks in disease prediction. In this study, we propose CERDA-HOSR, a computational method for mining ceRNA network-disease associations based on higher order graph attention networks. The method uses higher order graph convolutional networks to aggregate neighborhood information to generate representations of different RNAs and diseases. Given the higher order complexity of biological networks and sample imbalance problem, traditional random negative sampling is difficult to effectively capture global information; for this reason, a higher order negative sampling strategy is designed to optimize the quality of negative samples by combining the network structure and higher order neighborhood relations to improve the generalization ability and prediction accuracy of the model. Finally, LightGBM calculates the ceRNA network-disease association probability based on the learned embedding. A large number of simulation experiments validate the superiority of CERDA-HOSR, and its practical application is further demonstrated by case studies of cardiovascular disease, acute myeloid leukemia, and papillary thyroid cancer. In addition, ablation experiments and exploratory analyses further enhance its robustness and provide an effective tool for disease prediction and biomarker screening.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 5","pages":""},"PeriodicalIF":7.7000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495994/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf518","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Competitive endogenous RNA (ceRNA) network regulation is an important posttranscriptional regulatory mechanism that plays an important role in physiological and pathological processes, and has been widely used in biomarker screening and regulatory factor studies of disease-related genes. However, existing studies have mainly focused on the association of a single type of RNA with disease, while studies targeting the application of ceRNA networks in disease prediction are still limited, so it is crucial to explore the potential of ceRNA networks in disease prediction. In this study, we propose CERDA-HOSR, a computational method for mining ceRNA network-disease associations based on higher order graph attention networks. The method uses higher order graph convolutional networks to aggregate neighborhood information to generate representations of different RNAs and diseases. Given the higher order complexity of biological networks and sample imbalance problem, traditional random negative sampling is difficult to effectively capture global information; for this reason, a higher order negative sampling strategy is designed to optimize the quality of negative samples by combining the network structure and higher order neighborhood relations to improve the generalization ability and prediction accuracy of the model. Finally, LightGBM calculates the ceRNA network-disease association probability based on the learned embedding. A large number of simulation experiments validate the superiority of CERDA-HOSR, and its practical application is further demonstrated by case studies of cardiovascular disease, acute myeloid leukemia, and papillary thyroid cancer. In addition, ablation experiments and exploratory analyses further enhance its robustness and provide an effective tool for disease prediction and biomarker screening.
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.