{"title":"Machine learning approaches for predicting the small molecule-miRNA associations: a comprehensive review.","authors":"Ashish Panghalia, Vikram Singh","doi":"10.1007/s11030-025-11211-9","DOIUrl":null,"url":null,"abstract":"<p><p>MicroRNAs (miRNAs) are evolutionarily conserved small regulatory elements that are ubiquitous in cells and are found to be abnormally expressed during the onset and progression of several human diseases. miRNAs are increasingly recognized as potential diagnostic and therapeutic targets that could be inhibited by small molecules (SMs). The knowledge of SM-miRNA associations (SMAs) is sparse, mainly because of the dynamic and less predictable 3D structures of miRNAs that restrict the high-throughput screening of SMs. Toward augmenting the costly and laborious experiments determining the SM-miRNA interactions, machine learning (ML) has emerged as a cost-effective and efficient platform. In this article, various aspects associated with the ML-guided predictions of SMAs are thoroughly reviewed. Firstly, a detailed account of the SMA data resources useful for algorithms training is provided, followed by an elaboration of various feature extraction methods and similarity measures utilized on SMs and miRNAs. Subsequent to a summary of the ML algorithms basics and a brief description of the performance measures, an exhaustive census of all the 32 ML-based SMA prediction methods developed so far is outlined. Distinctive features of these methods have been described by classifying them into six broad categories, namely, classical ML, deep learning, matrix factorization, network propagation, graph learning, and ensemble learning methods. Trend analyses are performed to investigate the patterns in ML algorithms usage and performance achievement in SMA prediction. Outlining key principles behind the up-to-date methodologies and comparing their accomplishments, this review offers valuable insights into critical areas for future research in ML-based SMA prediction.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2025-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-025-11211-9","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
MicroRNAs (miRNAs) are evolutionarily conserved small regulatory elements that are ubiquitous in cells and are found to be abnormally expressed during the onset and progression of several human diseases. miRNAs are increasingly recognized as potential diagnostic and therapeutic targets that could be inhibited by small molecules (SMs). The knowledge of SM-miRNA associations (SMAs) is sparse, mainly because of the dynamic and less predictable 3D structures of miRNAs that restrict the high-throughput screening of SMs. Toward augmenting the costly and laborious experiments determining the SM-miRNA interactions, machine learning (ML) has emerged as a cost-effective and efficient platform. In this article, various aspects associated with the ML-guided predictions of SMAs are thoroughly reviewed. Firstly, a detailed account of the SMA data resources useful for algorithms training is provided, followed by an elaboration of various feature extraction methods and similarity measures utilized on SMs and miRNAs. Subsequent to a summary of the ML algorithms basics and a brief description of the performance measures, an exhaustive census of all the 32 ML-based SMA prediction methods developed so far is outlined. Distinctive features of these methods have been described by classifying them into six broad categories, namely, classical ML, deep learning, matrix factorization, network propagation, graph learning, and ensemble learning methods. Trend analyses are performed to investigate the patterns in ML algorithms usage and performance achievement in SMA prediction. Outlining key principles behind the up-to-date methodologies and comparing their accomplishments, this review offers valuable insights into critical areas for future research in ML-based SMA prediction.
期刊介绍:
Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including:
combinatorial chemistry and parallel synthesis;
small molecule libraries;
microwave synthesis;
flow synthesis;
fluorous synthesis;
diversity oriented synthesis (DOS);
nanoreactors;
click chemistry;
multiplex technologies;
fragment- and ligand-based design;
structure/function/SAR;
computational chemistry and molecular design;
chemoinformatics;
screening techniques and screening interfaces;
analytical and purification methods;
robotics, automation and miniaturization;
targeted libraries;
display libraries;
peptides and peptoids;
proteins;
oligonucleotides;
carbohydrates;
natural diversity;
new methods of library formulation and deconvolution;
directed evolution, origin of life and recombination;
search techniques, landscapes, random chemistry and more;