Michael Cabanillas-Carbonell , Joselyn Zapata-Paulini
{"title":"Evaluation of machine learning models for the prediction of Alzheimer's: In search of the best performance","authors":"Michael Cabanillas-Carbonell , Joselyn Zapata-Paulini","doi":"10.1016/j.bbih.2025.100957","DOIUrl":null,"url":null,"abstract":"<div><div>Alzheimer's is a progressive and degenerative disease affecting millions worldwide, incapacitating them physically and cognitively. This study aims to perform a comparative analysis of Machine Learning models to determine the model with the best performance in predicting Alzheimer's disease. The models used were Random Forest (RF), Adaptive Boosting (AdaBoost), Support Vector Machine (SVM), K-nearest Neighbors (KNN), and Logistic Regression (LR). Two datasets called OASIS were used to train the models, the first one had a total of 436 records and 12 variables, while the second one stored 373 records and 15 variables. The article's content is divided into six main sections: introduction, literature review, methodological approach, results, discussions, and conclusions. After processing and pooling the datasets, RF, SVM, and LR proved the best predictors, achieving 96% accuracy, precision, sensitivity, and F1 score. This study highlights the efficacy of RF, SVM, and LR in predicting Alzheimer's disease, offering a significant advance toward understanding and management of this disease, which supports the relevance of implementing these models in future research and clinical applications.</div></div>","PeriodicalId":72454,"journal":{"name":"Brain, behavior, & immunity - health","volume":"44 ","pages":"Article 100957"},"PeriodicalIF":3.7000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain, behavior, & immunity - health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666354625000158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Alzheimer's is a progressive and degenerative disease affecting millions worldwide, incapacitating them physically and cognitively. This study aims to perform a comparative analysis of Machine Learning models to determine the model with the best performance in predicting Alzheimer's disease. The models used were Random Forest (RF), Adaptive Boosting (AdaBoost), Support Vector Machine (SVM), K-nearest Neighbors (KNN), and Logistic Regression (LR). Two datasets called OASIS were used to train the models, the first one had a total of 436 records and 12 variables, while the second one stored 373 records and 15 variables. The article's content is divided into six main sections: introduction, literature review, methodological approach, results, discussions, and conclusions. After processing and pooling the datasets, RF, SVM, and LR proved the best predictors, achieving 96% accuracy, precision, sensitivity, and F1 score. This study highlights the efficacy of RF, SVM, and LR in predicting Alzheimer's disease, offering a significant advance toward understanding and management of this disease, which supports the relevance of implementing these models in future research and clinical applications.