Aliaksandra Sikirzhytskaya , Ilya Tyagin , S. Scott Sutton , Michael D. Wyatt , Ilya Safro , Michael Shtutman
{"title":"AI-based mining of biomedical literature: Applications for drug repurposing for the treatment of dementia","authors":"Aliaksandra Sikirzhytskaya , Ilya Tyagin , S. Scott Sutton , Michael D. Wyatt , Ilya Safro , Michael Shtutman","doi":"10.1016/j.artmed.2025.103218","DOIUrl":null,"url":null,"abstract":"<div><div>Neurodegenerative diseases like Alzheimer's, Parkinson's, and HIV-associated neurocognitive disorder severely impact patients and healthcare systems. While effective treatments remain limited, researchers are actively developing ways to slow progression and improve patient outcomes, requiring innovative approaches to handle huge volumes of new scientific data. To enable the automatic analysis of biomedical data we introduced AGATHA, an effective AI-based literature mining tool that can navigate massive scientific literature databases. The overarching goal of this effort is to adapt AGATHA for drug repurposing by revealing hidden connections between FDA-approved medications and a health condition of interest. Our tool converts the abstracts of peer-reviewed papers from PubMed into multidimensional space where each gene and health condition are represented by specific metrics. We implemented advanced statistical analysis to reveal distinct clusters of scientific terms within the virtual space created using AGATHA-calculated parameters for selected health conditions and genes. Partial Least Squares Discriminant Analysis was employed for categorizing and predicting samples (122 diseases and 20,889 genes) fitted to specific classes. Advanced statistics were employed to build a discrimination model and extract lists of genes specific to each disease class. We focused on repurposing drugs for dementia by identifying dementia-associated genes highly ranked in other disease classes. The method was developed for detection of genes that shared across multiple conditions and classified them based on their roles in biological pathways. This led to the selection of six primary drugs for further study.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"168 ","pages":"Article 103218"},"PeriodicalIF":6.2000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0933365725001538","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Neurodegenerative diseases like Alzheimer's, Parkinson's, and HIV-associated neurocognitive disorder severely impact patients and healthcare systems. While effective treatments remain limited, researchers are actively developing ways to slow progression and improve patient outcomes, requiring innovative approaches to handle huge volumes of new scientific data. To enable the automatic analysis of biomedical data we introduced AGATHA, an effective AI-based literature mining tool that can navigate massive scientific literature databases. The overarching goal of this effort is to adapt AGATHA for drug repurposing by revealing hidden connections between FDA-approved medications and a health condition of interest. Our tool converts the abstracts of peer-reviewed papers from PubMed into multidimensional space where each gene and health condition are represented by specific metrics. We implemented advanced statistical analysis to reveal distinct clusters of scientific terms within the virtual space created using AGATHA-calculated parameters for selected health conditions and genes. Partial Least Squares Discriminant Analysis was employed for categorizing and predicting samples (122 diseases and 20,889 genes) fitted to specific classes. Advanced statistics were employed to build a discrimination model and extract lists of genes specific to each disease class. We focused on repurposing drugs for dementia by identifying dementia-associated genes highly ranked in other disease classes. The method was developed for detection of genes that shared across multiple conditions and classified them based on their roles in biological pathways. This led to the selection of six primary drugs for further study.
期刊介绍:
Artificial Intelligence in Medicine publishes original articles from a wide variety of interdisciplinary perspectives concerning the theory and practice of artificial intelligence (AI) in medicine, medically-oriented human biology, and health care.
Artificial intelligence in medicine may be characterized as the scientific discipline pertaining to research studies, projects, and applications that aim at supporting decision-based medical tasks through knowledge- and/or data-intensive computer-based solutions that ultimately support and improve the performance of a human care provider.