{"title":"评估用于老年骨关节炎机器学习预测模型的血液和尿液生物标记物:可行性研究","authors":"Jun-hee Kim","doi":"10.1016/j.cmpb.2025.108779","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Osteoarthritis (OA) is a common degenerative joint disease, particularly affecting individuals aged >50 years. It deteriorates quality of life and restricts physical activity in the elderly. Early diagnosis of OA is crucial for effective management, slowing disease progression, and alleviating symptoms.</div></div><div><h3>Objectives</h3><div>This study evaluated the feasibility of utilizing biomarkers derived from blood and urine in developing predictive models for OA diagnosis in the elderly population. Additionally, we compared the derived biomarker model with a model using standard blood and urine variables to assess the impact of the derived biomarkers on OA diagnosis.</div></div><div><h3>Methods</h3><div>Data from 10,743 participants were analyzed, including variables from blood and urine tests. Machine learning algorithms were used to develop the models. Derived biomarkers were identified based on the most significant features highlighted by Shapley Additive exPlanations (SHAP) analysis. The performance of models based on blood and urine biomarkers was compared with that of models based on derived biomarkers, and important variables were analyzed using SHAP.</div></div><div><h3>Results</h3><div>The support vector machine demonstrated the highest accuracy (0.6245) and F1 score (0.6232) for the blood dataset, whereas the random forest model achieved the best performance (0.5770) for the urine dataset. The derived biomarker model, which combined biomarkers of high importance from the best-performing models, showed improved predictive performance compared with the model using all blood and urine variables. The derived biomarker model achieved the highest performance metrics, with the logistic regression algorithm yielding an accuracy of 0.6450, precision of 0.6443, recall of 0.6450, and F1 score of 0.6430.</div></div><div><h3>Conclusions</h3><div>Biomarkers derived from routinely available blood and urine tests show promise for the early detection and comprehensive diagnosis of OA in older patients. These biomarkers are practical for clinical use, as they can be integrated into routine testing, potentially aiding early detection and improving patient outcomes.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"266 ","pages":"Article 108779"},"PeriodicalIF":4.9000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluation of blood- and urine-derived biomarkers for machine learning prediction models of osteoarthritis in elderly patients: A feasibility study\",\"authors\":\"Jun-hee Kim\",\"doi\":\"10.1016/j.cmpb.2025.108779\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Osteoarthritis (OA) is a common degenerative joint disease, particularly affecting individuals aged >50 years. It deteriorates quality of life and restricts physical activity in the elderly. Early diagnosis of OA is crucial for effective management, slowing disease progression, and alleviating symptoms.</div></div><div><h3>Objectives</h3><div>This study evaluated the feasibility of utilizing biomarkers derived from blood and urine in developing predictive models for OA diagnosis in the elderly population. Additionally, we compared the derived biomarker model with a model using standard blood and urine variables to assess the impact of the derived biomarkers on OA diagnosis.</div></div><div><h3>Methods</h3><div>Data from 10,743 participants were analyzed, including variables from blood and urine tests. Machine learning algorithms were used to develop the models. Derived biomarkers were identified based on the most significant features highlighted by Shapley Additive exPlanations (SHAP) analysis. The performance of models based on blood and urine biomarkers was compared with that of models based on derived biomarkers, and important variables were analyzed using SHAP.</div></div><div><h3>Results</h3><div>The support vector machine demonstrated the highest accuracy (0.6245) and F1 score (0.6232) for the blood dataset, whereas the random forest model achieved the best performance (0.5770) for the urine dataset. The derived biomarker model, which combined biomarkers of high importance from the best-performing models, showed improved predictive performance compared with the model using all blood and urine variables. The derived biomarker model achieved the highest performance metrics, with the logistic regression algorithm yielding an accuracy of 0.6450, precision of 0.6443, recall of 0.6450, and F1 score of 0.6430.</div></div><div><h3>Conclusions</h3><div>Biomarkers derived from routinely available blood and urine tests show promise for the early detection and comprehensive diagnosis of OA in older patients. These biomarkers are practical for clinical use, as they can be integrated into routine testing, potentially aiding early detection and improving patient outcomes.</div></div>\",\"PeriodicalId\":10624,\"journal\":{\"name\":\"Computer methods and programs in biomedicine\",\"volume\":\"266 \",\"pages\":\"Article 108779\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169260725001968\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725001968","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Evaluation of blood- and urine-derived biomarkers for machine learning prediction models of osteoarthritis in elderly patients: A feasibility study
Background
Osteoarthritis (OA) is a common degenerative joint disease, particularly affecting individuals aged >50 years. It deteriorates quality of life and restricts physical activity in the elderly. Early diagnosis of OA is crucial for effective management, slowing disease progression, and alleviating symptoms.
Objectives
This study evaluated the feasibility of utilizing biomarkers derived from blood and urine in developing predictive models for OA diagnosis in the elderly population. Additionally, we compared the derived biomarker model with a model using standard blood and urine variables to assess the impact of the derived biomarkers on OA diagnosis.
Methods
Data from 10,743 participants were analyzed, including variables from blood and urine tests. Machine learning algorithms were used to develop the models. Derived biomarkers were identified based on the most significant features highlighted by Shapley Additive exPlanations (SHAP) analysis. The performance of models based on blood and urine biomarkers was compared with that of models based on derived biomarkers, and important variables were analyzed using SHAP.
Results
The support vector machine demonstrated the highest accuracy (0.6245) and F1 score (0.6232) for the blood dataset, whereas the random forest model achieved the best performance (0.5770) for the urine dataset. The derived biomarker model, which combined biomarkers of high importance from the best-performing models, showed improved predictive performance compared with the model using all blood and urine variables. The derived biomarker model achieved the highest performance metrics, with the logistic regression algorithm yielding an accuracy of 0.6450, precision of 0.6443, recall of 0.6450, and F1 score of 0.6430.
Conclusions
Biomarkers derived from routinely available blood and urine tests show promise for the early detection and comprehensive diagnosis of OA in older patients. These biomarkers are practical for clinical use, as they can be integrated into routine testing, potentially aiding early detection and improving patient outcomes.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.